Skip to content

thevindu02/Breast-Cancer-Classification-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Breast Cancer Classification using Logistic Regression

This project applies Logistic Regression to classify breast cancer tumors as Benign or Malignant using the Breast Cancer Wisconsin (Diagnostic) dataset from Kaggle.

The goal is to build a simple but effective predictive model for early detection of breast cancer, which is a critical step in assisting doctors with diagnosis.

πŸ“Š Dataset

  • Source: Breast Cancer Wisconsin (Diagnostic) Dataset - Kaggle
  • Features: 30 numerical features computed from digitized images of breast masses (radius, texture, smoothness, etc.)
  • Target:
    • 0 β†’ Malignant (cancerous)
    • 1 β†’ Benign (non-cancerous)
  • Samples: 569 total

πŸ› οΈ Tech Stack

  • Language: Python
  • Environment: Google Colab
  • Libraries Used:
    • numpy
    • pandas
    • matplotlib / seaborn (for visualization)
    • scikit-learn (for preprocessing, model training & evaluation)

πŸš€ Project Workflow

  1. Data Loading & Exploration

    • Import dataset, inspect features, check for missing values
    • Visualize class distribution
  2. Data Preprocessing

    • Train-test split
  3. Model Building

    • Logistic Regression model using Scikit-learn
  4. Model Evaluation

    • Accuracy Score

πŸ“ˆ Results

  • Accuracy Achieved: ~95% (depending on random split)
  • Logistic Regression proved effective in distinguishing between benign and malignant cases.

πŸ§‘β€πŸ’» Author

  • Thevindu Dilmith

About

Trained a machine learning model to classify breast cancer as 'benign' or 'malignant'

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published