Skip to content

Conversation

@ahmetcetin
Copy link

stopwords updated and added new languages using stopwords iso (https://github.com/stopwords-iso)

@MikeASchneider
Copy link
Collaborator

FYI this project is an abandoned fork of an abandoned fork of an abandoned project.

@ahmetcetin
Copy link
Author

@MikeASchneider indeed, i noticed actually, but it still works fine. the main problem is tokenizer in fact, for other languages than English. Especially it doesn't work properly for languages like Hindu, Chinese, etc. Do you have suggestion for a similar library?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants