GitHub - camilleAmaury/MachineLearningMaster1DataScience: Repository of exercices done during the french master degree of datascience in the Nantes University.

Name	Name	Last commit message	Last commit date
Latest commit History 11 Commits
DataMining	DataMining
ProbabilisticModel	ProbabilisticModel
TP1	TP1
TP2	TP2
TP3	TP3
images	images
README.md	README.md

Machine Learning Master 1 DataScience Exercices

Clément Antheaume - Camille-Amaury Juge

About the Project
- Built With
Content
License
Contact

About The Project

Here is a basic repository of all the directed and praticed missions we did during the first master degree year of machine learning in Nantes' University.

Built With

Here is a list of the Python library we used :

Content

TP1 : Naive Bayes, text preprocessing

Dataset : Users opinion (commentary and rate from 1 to 5) related to a course, see the csv file "/TP1/reviews_by_course.csv"

Aim :

Open a CSV file from pandas.
Quickly explore data with basic pandas function.
Learn how to process words using two approach (bag of words (hot encoder), TF-IDF) using Sklearn.
Learn how to implements a Naive Bayes model (Multinomial) in order to predict classification (the rate of the user) using Sklearn.
Learn to adjust model parameters.
Learn how to calculate and read results indicators.
Learn how to draw line-chart, standard deviation-chart using Plotly.

--> TF-IDF maybe need to be corrected (be careful about results)

--> need to play with more parameters than alpha

TP2 : Linear Regression, Gradient Boosting Regression

Dataset :

Multivalued indicators related to candidate chances of admission, see the csv file "/TP2/admissiondata.csv"
Multivalued indicators related to house's environnement with their price, see Load Boston

Aim :

Work with google colab, basic import functions and directory management.
Open a CSV file from pandas.
Quickly explore data with basic pandas function.
Pre-processing datas with pandas, numpy.
Learn how to understand Correlation Matrix (you can see an other repository nammed Data Analysis Project for more detailled statistics).
Learn how to implements a Linear Regression using Sklearn.
Learn how to calculate and read results indicators.
Learn how to implements a Gradient Boosting Regression using Sklearn.
Learn how to calculate and read results indicators.
Learn how to draw heathmap, scatterplot using Seaborn.
Learn how to draw barplot using Matplotlib.

TP3 : Decision Tree

Dataset :

Multivalued indicators related to people's rating on movies, see the csv file "/TP3/movies.csv"

Aim :

Work with google colab, basic import functions and directory management.
Open a CSV file from pandas.
Quickly explore data with basic pandas function.
Pre-processing datas with pandas, numpy.
Learn how to understand Confusion Matrix.
Learn how to implements a Decision Tree using Sklearn.
Learn how to calculate and read results indicators.

DataMining - TP2 : Basic process of the machine learning

Dataset :

Multivalued indicators related to people's bank crediting, see the csv file "/DataMining/TP2/german.data"

Aim :

Work with google colab, basic import functions and directory management.
Open a CSV file from pandas.
Quickly explore data with basic pandas function.
Pre-processing datas with pandas, numpy.
Learn how to understand Confusion Matrix.
Learn how to implements a Decision Tree, Random Forest, KNN using Sklearn.
Learn how to implements a Grid Search in order to do fine-tuning.
Learn how to implements a Cross-Validation rather than simple validation.
Learn how to calculate and read results indicators.

Probabilist Model - Backward, Forward :

Dataset :

see the file "/ProbabilisticModel/PM_Vitterbi_Forward_Backward.html"

Aim :

Implements Backward and Forward algorithm with python for PFA.

License

You can freely copy notebooks except for correction ones and pdf files which belongs to the Nantes' University.

Contact

See the linkedIn shields in the top of the README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Master 1 DataScience Exercices

Table of Contents

About The Project

Built With

Content

TP1 : Naive Bayes, text preprocessing

TP2 : Linear Regression, Gradient Boosting Regression

TP3 : Decision Tree

DataMining - TP2 : Basic process of the machine learning

Probabilist Model - Backward, Forward :

License

Contact

About

Uh oh!

Releases

Packages

Languages

camilleAmaury/MachineLearningMaster1DataScience

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Master 1 DataScience Exercices

Table of Contents

About The Project

Built With

Content

TP1 : Naive Bayes, text preprocessing

TP2 : Linear Regression, Gradient Boosting Regression

TP3 : Decision Tree

DataMining - TP2 : Basic process of the machine learning

Probabilist Model - Backward, Forward :

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages