You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added the options to use a custom POS-tagger, define custom stop words, and exclude certain spaCy pipeline components. This release solves issues #2 and #7.
Higher compatibility with available SpaCy pipelines
Fixed issue #11 and #10 by removing the default exclusion of certain spaCy pipeline components. This slightly slows down the keyphrase extraction process. However it grants higher compatibility to all available spaCy pipelines, including the ones that use transformers.
change "multiprocessing" parameter to "workers" parameter
Signed-off-by: Tim Schopf <tim.schopf@t-online.de>
Added min_df and max_df parameters, added support for documents that have more than 1000000 characters, and limit max keyphrase length to 8 words to prevent memory issues