-
Notifications
You must be signed in to change notification settings - Fork 189
Open
Description
""" Creates a vocabulary from a tsv file.
"""
import codecs
import example_helper
from torchmoji.create_vocab import VocabBuilder
from torchmoji.word_generator import TweetWordGenerator
with codecs.open('../../twitterdata/tweets.2016-09-01', 'rU', 'utf-8') as stream:
wg = TweetWordGenerator(stream)
vb = VocabBuilder(wg)
vb.count_all_words()
vb.save_vocab()
In this code, in oder to create a vocabulary, you had been used '../../twitterdata/tweets.2016-09-01'
dataset. But where I will find this dataset? Please let me know.
Please share this dataset with my mail rezwanh001@gmail.com, if it is possible.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels