MachineLearning

Machine Learning Python -- From scratch. Methods: Linear Regression, Logistic Regression, Neural Network, K-means, PCA

This script is a project I have created to showcase my understanding of machine learning methods I have learned about through classes or on my own.

Current methods used:

                  - Basic Linear Regression
                  
                  - Logistic Regression (basic classification)
                  
                  - Neural Network (advanced classification)
                  
                  - K-Means (unsupervised clustering)
                  
                  - PCA (dimension reduction)
                  
                  - SVM (support vector machine)

Some of these methods implement minimization from scipy.optimize, and others use basic gradient descent.

The current data set used is breast cancer data from kaggle, which can be found here: https://www.kaggle.com/uciml/breast-cancer-wisconsin-data

Current functionality:

                   - Neural Network:
                                    - Should work well with any numerical data, will fit structure to the data+output.
                                    
                                    - Auto class to vectorized output.
                                    
                                    - Forward/backprop algorithm for learning
                                    
                                    - Regularization.
                                    
                                    - Easy to manipulate number of hidden layers/nodes.
                                    
                                    - Takes learning parameters (easy to loop over for optimization).
                              
                                    
                   - K-Means:
                              - Separates data into K groupings.
                              
                              - plots groupings using reduced features.
                              
                              - plots movement of centroids over iterations.
                              
                   - SVM:
                              - Separates data based on boundary with maximized separation
    
                              
                   - PCA:
                          - Reduces dimensions so that 99% of variance is retained.
                          
                          - Finds unreduced features that contribute highly to separation after reduction.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
PythonML		PythonML
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MachineLearning

About

Uh oh!

Releases

Packages

Languages

ahorstt/MachineLearning

Folders and files

Latest commit

History

Repository files navigation

MachineLearning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages