Skip to content

Generate data for different tasks (linear regression, logistic regression, etc.)

Notifications You must be signed in to change notification settings

fiquinho/data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data generator for machine learning models

This repository is meant to be a collection of scripts for generating different data sets that can be used to train machine learning models.

This data points are randomly generated and try to imitate real world data. They can be used to test how a model performs on different data distributions for the same task.

There is a Web UI in this project served with Streamlit. Check it out here.

Available generators

This module will create a data set of 2D data points that can be model using linear regression. They will be created by randomly deviating from a linear function. The linear function parameters, and the distribution of the deviations can be controlled by the script. It will generate a .csv file with the data, and it will plot the generated data. The linear function from where the data was generated can also be plotted.

The plotted data will look something like this:

linear_regression_data_example

About

Generate data for different tasks (linear regression, logistic regression, etc.)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages