Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
39 changes: 34 additions & 5 deletions 1.0 Data Loading.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -181,8 +181,7 @@
"\n",
"- each column of data is separated by a comma (or other equivalent and specified separator);\n",
"- the first row of the file may correspond to column headers;\n",
"- the first column of each row may correspond to values of the `row index`.\n",
"```"
"- the first column of each row may correspond to values of the `row index`."
]
},
{
Expand Down Expand Up @@ -1315,7 +1314,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"**!** The date parse is US datew friendly! *MM/DD/YYYY*\n",
"**!** The date parse is US date friendly! *MM/DD/YYYY*\n",
"\n",
"To use the more common international format for sure,<br>\n",
"add \n",
Expand All @@ -1329,7 +1328,7 @@
"The CSV import may be highly customized, <br>e.g.:\n",
"\n",
"* `date_parser` - which columns to parse.\n",
"* `compression` - `pandas` hint compression of file, default: `infer`- auto discovery\n",
"* `compression` - `pandas` hint compression of file, default: `infer` - auto discovery\n",
"* `delimiter` - delimiter\n",
"* `thousands`, `decimal` - thousands or decimal character\n",
"* `encoding` - encoding of the file\n",
Expand Down Expand Up @@ -3109,6 +3108,7 @@
}
],
"metadata": {
"hide_input": false,
"kernelspec": {
"display_name": "Python 3",
"language": "python",
Expand All @@ -3124,7 +3124,36 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.5"
"version": "3.5.3"
},
"varInspector": {
"cols": {
"lenName": 16,
"lenType": 16,
"lenVar": 40
},
"kernels_config": {
"python": {
"delete_cmd_postfix": "",
"delete_cmd_prefix": "del ",
"library": "var_list.py",
"varRefreshCmd": "print(var_dic_list())"
},
"r": {
"delete_cmd_postfix": ") ",
"delete_cmd_prefix": "rm(",
"library": "var_list.r",
"varRefreshCmd": "cat(var_dic_list()) "
}
},
"types_to_exclude": [
"module",
"function",
"builtin_function_or_method",
"instance",
"_Feature"
],
"window_display": false
}
},
"nbformat": 4,
Expand Down
36 changes: 33 additions & 3 deletions 1.3 Data Wrangling.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,8 @@
"metadata": {},
"outputs": [],
"source": [
"import pandas as pd"
"import pandas as pd\n",
"import os"
]
},
{
Expand All @@ -30,7 +31,6 @@
"metadata": {},
"outputs": [],
"source": [
"import os\n",
"customer_churn_dataset = os.path.join(os.path.abspath(os.path.curdir), 'data', 'customer-churn-model', 'Customer Churn Model.txt')"
]
},
Expand Down Expand Up @@ -4658,6 +4658,7 @@
}
],
"metadata": {
"hide_input": false,
"kernelspec": {
"display_name": "Python 3",
"language": "python",
Expand All @@ -4673,7 +4674,36 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.5"
"version": "3.5.3"
},
"varInspector": {
"cols": {
"lenName": 16,
"lenType": 16,
"lenVar": 40
},
"kernels_config": {
"python": {
"delete_cmd_postfix": "",
"delete_cmd_prefix": "del ",
"library": "var_list.py",
"varRefreshCmd": "print(var_dic_list())"
},
"r": {
"delete_cmd_postfix": ") ",
"delete_cmd_prefix": "rm(",
"library": "var_list.r",
"varRefreshCmd": "cat(var_dic_list()) "
}
},
"types_to_exclude": [
"module",
"function",
"builtin_function_or_method",
"instance",
"_Feature"
],
"window_display": false
}
},
"nbformat": 4,
Expand Down
34 changes: 32 additions & 2 deletions 1.4 Data Grouping and Filtering.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@
"# Data Grouping and Filtering\n",
"\n",
"Another quite popular operations to perform on data are **grouping** and **filtering**.\n",
"The former is used to group values into mulitple subgroups and operate on specific groups; the latter is to dwelve into data based on specific categories or filters on values.\n",
"The former is used to group values into mulitple subgroups and operate on specific groups; the latter is to delve into data based on specific categories or filters on values.\n",
"Most of the time, these two operations are performed together.\n",
"\n",
"Pandas allows to do group and filtering, thanks to the `groupby` and `filter` functions. \n",
Expand Down Expand Up @@ -4230,6 +4230,7 @@
}
],
"metadata": {
"hide_input": false,
"kernelspec": {
"display_name": "Python 3",
"language": "python",
Expand All @@ -4245,7 +4246,36 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.5"
"version": "3.5.3"
},
"varInspector": {
"cols": {
"lenName": 16,
"lenType": 16,
"lenVar": 40
},
"kernels_config": {
"python": {
"delete_cmd_postfix": "",
"delete_cmd_prefix": "del ",
"library": "var_list.py",
"varRefreshCmd": "print(var_dic_list())"
},
"r": {
"delete_cmd_postfix": ") ",
"delete_cmd_prefix": "rm(",
"library": "var_list.r",
"varRefreshCmd": "cat(var_dic_list()) "
}
},
"types_to_exclude": [
"module",
"function",
"builtin_function_or_method",
"instance",
"_Feature"
],
"window_display": false
}
},
"nbformat": 4,
Expand Down
Loading