Skip to content

dm464/CS301-final-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Classifying fetal health in order to prevent child and maternal mortality

Description

Reduction of child mortality is reflected in several of the United Nations' Sustainable Development Goals and is a key indicator of human progress. The UN expects that by 2030, countries end preventable deaths of newborns and children under 5 years of age, with all countries aiming to reduce under‑5 mortality to at least as low as 25 per 1,000 live births.

Parallel to notion of child mortality is of course maternal mortality, which accounts for 295 000 deaths during and following pregnancy and childbirth (as of 2017). The vast majority of these deaths (94%) occurred in low-resource settings, and most could have been prevented.

In light of what was mentioned above, Cardiotocograms (CTGs) are a simple and cost accessible option to assess fetal health, allowing healthcare professionals to take action in order to prevent child and maternal mortality. The equipment itself works by sending ultrasound pulses and reading its response, thus shedding light on fetal heart rate (FHR), fetal movements, uterine contractions and more.

About the Data

This dataset contains 2126 records of features extracted from Cardiotocogram exams, which were then classified by three expert obstetritians into 3 classes:

  • Normal
  • Suspect
  • Pathological

For this project, we are creating a model to classify the outcome of Cardiotocogram (CTG) exam (which represents the well being of the fetus).

Tasks

  1. Present a visual distribution of the 3 classes. Is the data balanced? How do you plan to circumvent the data imbalance problem, if there is one? (Stratification needs to be included)
  2. Present 10 features that are most reflective to fetal health conditions (there are more than one way of selecting features and any of these are acceptable) . Present if the correlation is statistically significant (using 95% and 90% critical values)
  3. Develop two different models to classify CTG features into the three fetal health states (Note that this is a multiclass problem that can also be treated as regression, since the labels are numeric.)
  4. Visually present the confusion matrix
  5. With a testing set of size of 30% of all available data, calculate
    • Area under the ROC Curve
    • F1 Score
    • Area under the Precision-Recall Curve
    • (for both models in 3)
  6. Without considering the class label attribute, use k-means clustering to cluster the records in different clusters and visualize them (use k to be 5, 10, 15).

What to submit?

a. all code

b. a document that adequately describes your approach for each of the questions.

c. visualizations and other results.

Team

  • Ranfis Fracisco
  • Denisse Mendoza

About

Classifying fetal health in order to prevent child and maternal mortality.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages