NYTimes Mining Portfolio

A lightweight, end-to-end data mining pipeline that fetches news articles from the NYTimes API, preprocesses text using Python (Pandas, NLTK, spaCy), and uncovers hidden topics with NMF and KMeans clustering, visualized via insightful word clouds.

Features

NYTimes API Integration: Retrieve news articles with custom queries and date ranges.
Secure API Key Management: Utilizes a .env file to keep your API key private.
Text Preprocessing: Lowercases, cleans, and removes stopwords from article text.
NER & Topic Modeling: Extracts named entities and discovers latent topics.
Clustering: Groups similar articles for deeper analysis.

Language & Libraries

How to use this repository?

Step 1: Clone this repository

Run the command below in your terminal

git clone https://github.com/shoibolina/NYTimes-mining.git

Step 2: Obtain API key and Set-up environment variables

Create a .env file using terminal at the project directory to securely store the API key.

touch .env

Create an account at The New York Times and get your Article Search API Key. Open the previously created .env file and enter your api key as follows:

NYTIMES_API_KEY=your_nytimes_api_key

This file is included in .gitignore to prevent sharing/committing the api keys.

Step 3: Install libraries

Install the python libraries listed in Language & Libraries

Step 4: Run the notebook

Follow through the comments in the notebook and have fun exploring!

Author

Shoibolina Kaushik
_{Master of Science, Computer Science (25G)

Emory University}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
imgs		imgs
.gitignore		.gitignore
LICENSE		LICENSE
NYTimes mining portfolio.ipynb		NYTimes mining portfolio.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NYTimes Mining Portfolio

Features

Language & Libraries

How to use this repository?

Step 1: Clone this repository

Step 2: Obtain API key and Set-up environment variables

Step 3: Install libraries

Step 4: Run the notebook

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NYTimes Mining Portfolio

Features

Language & Libraries

How to use this repository?

Step 1: Clone this repository

Step 2: Obtain API key and Set-up environment variables

Step 3: Install libraries

Step 4: Run the notebook

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages