Correction for text with punctuation and dash #33

hyunjoolee · 2020-01-17T06:34:52Z

I have found that if there are punctuation and dash characters in the text, they are not converted to clean text in text/init.py get_arpabet().

For examples, words like "recommendations.", "fbi," and "policy-making" are not searchable in the cmu_dict.
I think these will reduce model performance.

So I suggest some code as attached.

ava added 2 commits January 17, 2020 23:08

Correction for text with punctuation and dash

79a929c

Correction for text with punctuation and dash

ddf8492

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Correction for text with punctuation and dash #33

Correction for text with punctuation and dash #33

Uh oh!

hyunjoolee commented Jan 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Correction for text with punctuation and dash #33

Are you sure you want to change the base?

Correction for text with punctuation and dash #33

Uh oh!

Conversation

hyunjoolee commented Jan 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant