Amsterdam University College -- Text Mining -- Winter/Spring 2021.
You can use the Hello World notebooks to check that everything is working.
| Week | Topic | Materials |
|---|---|---|
| 1 | Introduction and Python refresher | slides + notebooks 1, 2, 3, 4, 5 |
| 2 | Introduction to NLP and NLP pipelines | slides + notebook |
| 3 | Language modelling | slides + notebooks 1, 2 |
| 4 | Vector space semantics | slides + notebook |
| 5 | Word embeddings | slides + notebook |
| 6 | Machine learning fundamentals | slides + notebook |
| 7 | Text classification | slides + notebook (Scikit-learn), notebook (PyTorch) |
| 8 | RNNs and NER | slides + notebook |
| 9 | Web scraping and APIs | notebook |
| 10 | Recommender systems | slides + notebook |
| 11 | Creating annotated corpora and sentiment analysis | slides + notebook |
| 12 | Clustering and topic modelling | slides + notebook |
| 13 | XAI and Ethics | Selected contents from this course |
- Introduction (Stanford's CS231N).
- Optimization 1 (Stanford's CS231N).
- Yes you should understand backprop by Andrej Karpathy.
- Optimization 2 (Stanford's CS231N).
See the projects folder for info.
- Tim Holthuijsen, Sergio Kazatzidis, Emilia Chammas, NLPChef - An NLP model for cooking recipe creation
- Lisa van Gelderen, Alexia Muresan, Zoë Prins, Victor van der Sman, Gender Bias in Fairy Tales
- Sarah de Jong, Tom Klein Tijssink, Lukas Busch, Exploring Different Machine Learning Approaches To Generate Song Lyrics
- Yiyang Cheng, Amaan Syed, Tori Baral, Explaining the qualitative differences between different classifiers on cyberbullying datasets
- Ilai Bachrach, Marc Oliveau, Lyric-based recommendation for music playlists
- Berke Filiz, Yuval Goren, AnneLouise de Boer, Recipe Generator
- Clone the repository locally:
git clone https://github.com/Giovanni1085/AUC_TMCI_2021.git - Get updates (from time to time):
git pull - Create a conda environemnt:
conda create -n myenv python=3.7 anaconda(wheremyenvis the envirnoment name) - Activate it:
conda activate myenv - Install packages (see the
requirements.txtfile), e.g.conda install pandas - Launch a Jupyter notebook:
jupyter notebook
- More on conda enviroments
- Conda cheatsheet
- Getting started with Jupyter notebooks
- On using git and GitHub for version control
Alternatively, use Binder (link above).
A more detailed guide to setup your environment, with multiple options.
- The previous-year edition of this course.
- Michael Repplinger, who ran the 2018/19 edition and Gianluca Lebani, who ran the 2017/18 edition.
- Giovanni Colavizza and Matteo Romanello, Applied Data Analysis course for the Oxford Digitial Humanities Summer School
- James Hetherington and Giovanni Colavizza, Research Software Engineering with Python
Everything in this repository which is not already attributed to someone else is released under CC BY 4.0.