Data Analytics Tools - Python

A set of generic templates for data preprocessing, exploratory analysis, text analysis and data visualisation. This repository is designed to demonstrate proficiency in key data analytics techniques and tools.

About the Repository

This repository is a curated collection of Jupyter Notebooks that serve as templates for various tasks in the data analytics process. The purpose of this repository is to:

Showcase proficiency with Python for data-related tasks.
Serve as a resource for common workflows in data analysis.
Illustrate best practices in data preprocessing, EDA, and visualization.
It is aimed at prospective employers and anyone interested in data analytics.

Exploratory Data Analysis (EDA)
- Correlation Analysis
- Distribution Tests
- Similarity Analysis
Preprocessing
- Sampling Methods
- Aggregation
- Binarisation
- Handling Duplicates
- Extracting Nominal Categories
- Handling Missing Values
Text Analysis
- Case Folding
- Normalisation
- Stemming
- Stop Word Removal
- Tokenisation
Visualisation
- Heatmaps
- Histograms
- Scatterplots

Getting started

Prerequisites

Python 3.11+ Jupyter Notebook

Install required packages:

pip install pandas, scipy, sklearn, ydata_profiling, collections, re, nltk, seaborn, matplotlib

Usage

Clone the repository:
git clone https://github.com/spencerduberry/Data-Analytics-Tools_Python.git
Navigate to the relevant folder based on your task (e.g. preprocessing)
Run the scripts or notebooks:
jupyter notebook Aggregation.ipynb

Contributing

Contributions are welcome! If you have a useful template or improvement, feel free to open a pull request. Steps to contribute:

Fork the repository.

Create a branch (git checkout -b feature/NewFeature).
Commit your changes (git commit -m 'Add new feature').
Push the branch (git push origin feature/NewFeature).
Open a pull request.

Contact

Spencer Duberry
LinkedIn: www.linkedin.com/in/spencer-duberry-938233285
Email: spencerduberry@hotmail.co.uk

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
Exploratory Data Analysis		Exploratory Data Analysis
Pre-processing		Pre-processing
Text Analysis		Text Analysis
Visualisations		Visualisations
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
Reading_csv_files.ipynb		Reading_csv_files.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Analytics Tools - Python

Table of Contents

About the Repository

Contents

Exploratory Data Analysis (EDA)

Preprocessing

Text Analysis

Visualisation

Getting started

Prerequisites

Install required packages:

Usage

Contributing

Fork the repository.

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data Analytics Tools - Python

Table of Contents

About the Repository

Contents

Exploratory Data Analysis (EDA)

Preprocessing

Text Analysis

Visualisation

Getting started

Prerequisites

Install required packages:

Usage

Contributing

Fork the repository.

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages