- Data driven decision making
- Questions -> Data -> Models/Tools
| Data type | Models/Tools | Applications |
|---|---|---|
| -EHR data -Insurance claims data |
ML(logistic regression,XGBoost) | Predict outcomes (disease, death, readmission etc.) |
| -Clinical notes -Conversation text data |
-Rule based approach(regular expression) -Deep learning apporach |
-Extract concepts from clinical notes -Knowledge graphs -Chat-bot -QA system |
| Medical image data (X-ray, CT, OCR image etc.) | CNN | -Detection: diagnosis of skin cancer lung nodule or diabetic reinopathy -Segmentation of tumor, histopathology |
| Time series data (EEG, ECG, vital sign data etc.) | HMM,RNN,CNN | -Heart disease -Sleep disorder(apnea) -ICU monitoring |
| Genomics data | GATK,QIIME | -Cancer mutation identification -Biomarker identification -Durg discovery |
| Other data (hospital operational data) | -ML(regression) -Queueing model |
-Reduce operational cost -Improve patient experience -ER wait time and queueing |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| Review | Mining electronic health records: towards better research applications and clinical care | 2012 | |||
| Review | Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review | 2016 | |||
| heart failure | -logistic regression -random forest |
longitudinal EHR data | 1684 heart failure cases and 13525 matched controls | Early Detection of Heart Failure Using Electronic Health Records | 2016 |
| heart failure (review) | Population Risk Prediction Models for Incident Heart Failure | 2015 | |||
| Kidney transplant graft failure | Cox regression | 10-years EHR data | 69,440 kidney transpants | A comprehensive risk quantification score for deceased donor kidneys: the kidney donor risk index | 2009 |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| Review | Realizing the full potential of electronic health records: the role of natural language processing | 2011 | |||
| Review | Natural language processing: an introduction | 2011 | |||
| Negation | Regular expression and rule-based approach | Clinical reports | 2060 discharge summaries | A simple algorithm for identifying negated findings and diseases in discharge summaries | 2001 |
| Using electronic health records to drive discovery in disease genomics | |||||
| NER | discharge summaries | 826 notes | A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries | 2011 |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| Diabetic retinopathy | CNN | retinal fundus images | 128175 retinal images | Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs | 2016 |
| Skin cancer | CNN | skin images | 129,450 skin images | Dermatologist-level classification of skin cancer with deep neural networks | 2017 |
| Tumor | CNN | Pathology images | 400+110 slides | Detecting Cancer Metastases on Gigapixel Pathology Images | 2017 |
| Deep Learning Automates the Quantitative Analysis of Individual Cells in Live-Cell Imaging Experiments |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| sinus rhythm and atrial fibrillation | 34-layer convolutional neural network (CNN) | single-lead ECG | -(Train) 64,121 ECG records from 29,163 patients -(Test) 336 records from 328 unique patients |
Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks | 2017 |
| Hand movements | CNN | sEMG | 67 intact subjects and 11 transradial amputees | Deep Learning with Convolutional Neural Networks Applied to Electromyography Data: A Resource for the Classification of Movements for Prosthetic Hands | 2016 |
| Review | ICU data | Machine Learning and Decision Support in Critical Care | 2017 |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| Genetic variants | Exome NGS | NGS&EHR data | 50,726 individuals | Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study | 2016 |
| Familial hypercholesterolemia | Exome NGS | NGS&EHR data | 50,726 individuals | Genetic identification of familial hypercholesterolemia within a single U.S. health care system | 2016 |
| Prediction outcomes | Models/Tools | Data type | Sample size | Reference | Year |
|---|---|---|---|---|---|
| Drug discovery | LSTM | Assay | 12-27 assays | Low data drug discovery with one-shot learning | 2017 |
| Tutorial | Image | Deep learning models for health care: challenges and solutions | 2017 | ||
| Tutorial | Image | Deep learning in radiology: recent advances, challenges and future trends | 2016 | ||
| Tutorial | Big data analytics for healthcare | 2013 | |||
| Tutorial | Image | Survey of deep learning in radiology | 2017 | ||
| ER wait time | ER visit time | Accurate ED Wait Time Prediction | 2017 |