Skip to content

matteo-massa/CAPSTONEDAPT0325PT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏦 Sentiment Analysis


πŸ“ Repository Structure

CAPSTONEDAPT0325PT/
β”‚
β”œβ”€β”€ CapstonePt Scraping .ipynb     # Google Play Store Scraping
β”œβ”€β”€ progetto capstone P2.ipynb     # NLP with BERT + CUDA Β· spaCy Lemmatization Β· Counter Frequencies
β”œβ”€β”€ CAPSTONE SQL.sql               # All MySQL VIEWs of the Data Warehouse
β”œβ”€β”€ capstone matteo.pbix           # Power BI Dashboard β€” 4 executive pages
└── README.md

▢️ How to Replicate the Project

Requirements

pip install google-play-scraper pandas sqlalchemy pymysql
pip install transformers torch tqdm
pip install spacy
python -m spacy download it_core_news_sm

MySQL Connection Configuration

sqlalchemy_url = f'mysql+pymysql://{USER}:{PASSWORD}@{HOST}/{DATABASE}'

Execution Order

# 1. Google Play Store Scraping
python python/01_scraping.py

# 2. Sentiment classification with BERT
python python/02_sentiment_bert.py

# 3. Lemmatization with spaCy
python python/03_lemmatizzazione_spacy.py

# 4. Top 20 frequencies extraction
python python/04_frequenze_counter.py

# 5. Create MySQL VIEWs in order:
#    view_classifica_valutazione
#    β†’ view_dettaglio_recensioni_def
#    β†’ view_disservizi_frequenti_def
#    β†’ view_disservizi_frequenti_grezzo
#    β†’ view_trend_temporale

# 6. Open capstone_matteo.pbix in Power BI Desktop
#    Update the MySQL connection string in Transform Data

πŸŽ“ Final Capstone

Project created as the final Capstone for the Data Analytics PT course β€” Epicode (March 2025).


For questions about the architecture or to chat about the project, contact me on LinkedIn.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors