Skip to content

Feature/hot topic of the day#64

Open
fccoelho wants to merge 9 commits intomasterfrom
feature/hot-topic-of-the-day
Open

Feature/hot topic of the day#64
fccoelho wants to merge 9 commits intomasterfrom
feature/hot-topic-of-the-day

Conversation

@fccoelho
Copy link
Member

This feature is a script to calculate the hottest tokens of the day. I works by adding up the freqdists of every article published on a given day. It's different than whats is shown in the Media cloud dashboard (number of articles mentioning the word), here we calculate number of times the word has been printed on a given day.

It's not done yet, but I'd appreciate some input. The Idea is that we create a collection with these global freqdists in Mongo so that we can easily extract time series for analyses.

==== THIS REQUIRES THAT THE ARTICLE COLLECTION HAS BEEN SENT TO PYPLN ====

Added script to extract freqdist from index.
Added script to extract freqdist from index.
Added script to extract freqdist from index.
modified generate_freqdist to optionally return the freqdist as a dataframe
…loud_backend into feature/hot-topic-of-the-day
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant