DocumentClassification

Each line in data file represents a text used for farm advertisement. All the stop words have been removed from the texts. In the label file, the label +1 means the corresponding ad is accepted, while the label 0 means the ad is rejected.
For this I have implemented both Logistic Regression and Naive Bayes from scratch, without using any existing packages and functions to predict whether an ad can be accepted or not.

NOTE: I used the bag-of-words model and the number of occurrence of words in each ad as its feature. I assume that the positions of words do not matter and each attribute value is independently generated.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Logistic_Regression_and_Naive_Bayes.py		Logistic_Regression_and_Naive_Bayes.py
README.md		README.md
data_matrix.py		data_matrix.py
farm-ads-label.txt		farm-ads-label.txt
farm-ads.txt		farm-ads.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocumentClassification

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocumentClassification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages