nltk
https://github.com/nltk/nltk
Python
NLTK Source
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported38 Subscribers
View all SubscribersAdd a CodeTriage badge to nltk
Help out
- Issues
- Verbnet corpus is out of date
- Importing NLTK breaks multiprocessing
- TweetTokenizer causes UnicodeEncodeError when input string is valid
- Better PunktTrainer
- `word_tokenize` could handled URL's better
- Neither `word_tokenize` nor `TreebankWordTokenizer` matchs the original Penn Word Tokenizer
- word_tokenize keeps the opening single quotes and doesn't pad it with space
- nltk.stem.arlstem.ARLSTem#pref(token) returns None
- Consistent pos argument between wn.synsets() and WordNetLemmatizer.lemmatize()
- RecursionError in PorterStemmer._is_consonant
- Docs
- Python not yet supported