Compares the frequency of words from some source to the frequency of all academic texts in the COCA (Corpus of Contemporary American English). More information about the COCA project may be found here: http://www.academicvocabulary.info/. For more about COCA projects, created by Mark Davies at Brigham Young University, check out http://corpus.byu.edu/.
This project uses Maven for dependency management. If you don't have the Maven Eclipse plug-in, you may install it from the following Eclipse repo location: http://download.eclipse.org/technology/m2e/releases/
Dependencies used:
| Name + Website | Version | Description |
|---|---|---|
| Weka (API documentation) | 3.6.6 | Machine Learning/Classification Toolbox |
| POI (site) (Excel example) | 3.7 | API for Microsoft documents |