Skip to content

First version of vocabulary program #2

@arwer13

Description

@arwer13

With respect to the vocabulary program. Here is the starting specification as I see it.

Develop a schema of relational SQL database to store

  • words (all unique)

  • examples of word usage (whole sentences)

  • frequencies of words

    Result: file with sql code to create the schema.
    Notes:

    • At this stage it would be impossible to differentiate word forms, but it's ok.
    • Consider to use SQLite3 as database, because it's the simplest one, but very popular
    • pure xml or json or something like this are not good choices at all for storing and further processing of our kind of data (for many reasons). Some document-oriented (for example json documents) might be considered, but it 's to be a good choice in our case.

Develop a Python program for parsing provided text and adding / updating database.

It should have a simple command line interface: one argument -- path to .txt file

Develop a Python program for querying information about word:

  • all known examples of usage
  • it's frequency
  • it's percentage (among all known unique words)

This program should have a simple command line interface: one argument -- word to look up.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions