-
Notifications
You must be signed in to change notification settings - Fork 44
Homework 3
Your task is to develop a statistical model that takes a document and classifies it into one of the 5 sentiments: 0 - very negative, 1 - negative, 2 - neutral, 3 - positive, 4 - very positive.
-
Download the following files:
sst.trn.tsv,sst.dev.tsv,sst.tst.tsv. -
Each line in the file represents a document, where the format is as follows:
line ::= <label><tab><document> document ::= <token>(<space><token>)* -
Convert each document into a vector with your favorite method.
-
Implement the perceptron and train statistical models using different learning rates. Use the training set for training and the development set for validation.
-
Run the most optimized model on the evaluation set and print the predicted output. Save the output to
hw3.outwhere each line represents the predicted label for each document. -
Improve your model using different learning algorithms or lexicons if you can. Every submission will be ranked and the ranking score will be reflected to your grade.
-
Write a report describing your approach, results, and analysis. Use the ACL latex template.
- Compress your code and report into
hw3.zipand submit it to: https://canvas.emory.edu/courses/29596/assignments/32552
Copyright © 2015-2019 Emory University - All Rights Reserved.
