Skip to content

seanhold3n/mooc-mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mooc-mining

Compares the frequency of words from some source to the frequency of all academic texts in the COCA (Corpus of Contemporary American English). More information about the COCA project may be found here: http://www.academicvocabulary.info/. For more about COCA projects, created by Mark Davies at Brigham Young University, check out http://corpus.byu.edu/.

Dependencies

This project uses Maven for dependency management. If you don't have the Maven Eclipse plug-in, you may install it from the following Eclipse repo location: http://download.eclipse.org/technology/m2e/releases/

Dependencies used:

Name + Website Version Description
Weka (API documentation) 3.6.6 Machine Learning/Classification Toolbox
POI (site) (Excel example) 3.7 API for Microsoft documents

About

Statistical modeling and classification of MOOC blog posts

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages