There has been many cases that proved teenagers have less tendency to read newspaper articles. As a fellow teenager myself, I have put an effort to combat this issue by simplifying the process of reading the news easier and more convenient by providing a technique that extracts news summary and and classifying them based on the level of how positive or negative they are. Hoping that this will be of more fun and engaging way to improve the rate of teenage awareness in current affairs.
NOTE: URL of the news website is removed from the source code (dont ask me why)
The project is done with Node.js along with the Puppeteer library and more.
Web scraping because #WhoNeedsAPIsBoiiiii
- Decide on front-end framework to implement initial GUI
- Create a basic GUI
- Scrape URL
- Scrape Date & Author's name
- Scrape Images (Check legal rights)
- CSV
- JSON
- SQL
- XQuery
- Identify positive and negative tokens (Basic)
- Indonesian language SA?
- Identify positive and negative tokens (More Consistent)
- MongoDB to accept JSON
- Test Queries on MongoDB
- Automate entries to MongoDB with Back-end script
optikalefx - https://plus.google.com/+optikalefxx