Skip to content

sgsourav/crashds

Repository files navigation

CrashDS

Welcome to the World of Data Science with Python!

This is a crash course in Data Science, with Jupyter Notebook as the User Interface. I hope you'll like the experience. -- Sourav Sen Gupta


Python has two flavors -- Python 2 and Python 3. This set of examples are in Python 3, written and executed in the beautifully simple IDE Jupyter Notebook. Note that Jupyter has set up a localhost:8888 server to render the notebook in your computer's browser. It can render anything thereafter -- should be fun! Once you are familiar with the Jupyter Notebook environment, and the basic Python syntax in Module 0, feel free to explore the rest of the Modules on Data Science.

Module 0 : Python in Jupyter
Module 1 : Data Exploration
Module 2 : Linear Regression
Module 3 : Classification Tree
Module 4 : Bias Variance Trade-Off
Module 5 : Graphics in Plotly


To get the most out of this crash course, you'll need to download and use the Jupyter Notebooks for individual Modules posted in this repository (or git clone the repository). Before that, prepare your computer by installing the following.

Platform : Anaconda for Python 3.8 (https://www.anaconda.com/) Core Engine : Python 3.8 (it will get installed with Anaconda)
User Interface (IDE) : Jupyter Notebook (comes with Anaconda)

In case installing Anaconda for Python 3.8 is too heavy for you, and you know how to manage conda environments and packages, you may also download and install the MiniConda (https://docs.conda.io/en/latest/miniconda.html) package for Python 3.8. It's lightweight, but you will have to install all required Data Science packages individually, on your own.


This material is heavilly inspired by two wonderful books on Data Science, as follows -- one theoretical and one practical. I highly encourage anyone interested in hands-on Data Science to read these two books thoroughly to get a wholesome overview of the subject.

Highly recommended : Python 3 Cheat Sheet by Laurent Pointal. This 2-pager is a life-saver -- print and frame it! This material also borrows some ideas from the Python Notebooks for ISLR. You are encouraged to check it out while you read the book ISLR.


License Declaration : Following the lead from the inspirations for this material, and the spirit of Python education and development, all modules of this work are licensed under the Creative Commons Attribution 3.0 Unported License.

About

Crash Course on Data Science in Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published