-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
With respect to the vocabulary program. Here is the starting specification as I see it.
Develop a schema of relational SQL database to store
-
words (all unique)
-
examples of word usage (whole sentences)
-
frequencies of words
Result: file with sql code to create the schema.
Notes:- At this stage it would be impossible to differentiate word forms, but it's ok.
- Consider to use SQLite3 as database, because it's the simplest one, but very popular
- pure xml or json or something like this are not good choices at all for storing and further processing of our kind of data (for many reasons). Some document-oriented (for example json documents) might be considered, but it 's to be a good choice in our case.
Develop a Python program for parsing provided text and adding / updating database.
It should have a simple command line interface: one argument -- path to .txt file
Develop a Python program for querying information about word:
- all known examples of usage
- it's frequency
- it's percentage (among all known unique words)
This program should have a simple command line interface: one argument -- word to look up.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels