CAPSTONEDAPT0325PT/
β
βββ CapstonePt Scraping .ipynb # Google Play Store Scraping
βββ progetto capstone P2.ipynb # NLP with BERT + CUDA Β· spaCy Lemmatization Β· Counter Frequencies
βββ CAPSTONE SQL.sql # All MySQL VIEWs of the Data Warehouse
βββ capstone matteo.pbix # Power BI Dashboard β 4 executive pages
βββ README.md
pip install google-play-scraper pandas sqlalchemy pymysql
pip install transformers torch tqdm
pip install spacy
python -m spacy download it_core_news_smsqlalchemy_url = f'mysql+pymysql://{USER}:{PASSWORD}@{HOST}/{DATABASE}'# 1. Google Play Store Scraping
python python/01_scraping.py
# 2. Sentiment classification with BERT
python python/02_sentiment_bert.py
# 3. Lemmatization with spaCy
python python/03_lemmatizzazione_spacy.py
# 4. Top 20 frequencies extraction
python python/04_frequenze_counter.py
# 5. Create MySQL VIEWs in order:
# view_classifica_valutazione
# β view_dettaglio_recensioni_def
# β view_disservizi_frequenti_def
# β view_disservizi_frequenti_grezzo
# β view_trend_temporale
# 6. Open capstone_matteo.pbix in Power BI Desktop
# Update the MySQL connection string in Transform DataProject created as the final Capstone for the Data Analytics PT course β Epicode (March 2025).
For questions about the architecture or to chat about the project, contact me on LinkedIn.