Quality control assessment of LC-MS data from Omics dataset to ensure data quality .
This repository contains R scripts and data for performing quality control (QC) analyses on both omics and xenobiotics datasets. The analyses follow standard reproducibility metrics and WADA guidelines for chemical and biological data.
quality_control_for_data_quality/
├── QC_omics_statistical_analysis.Rmd
├──QC_xenobiotics_statistical_analysis.Rmd
├── input/
│ ├── formulas.csv
│ └── xenob_QC.csv
├── output/
├── quality_control_stats.Rproj
└── README.md
File: QC_omics_statistical_analysis.Rmd
This analysis performs quality control on LC-HRMS omics datasets.
- Load raw data: Import the LC-HRMS measurements.
- Clean data: Remove isotopologues, low-intensity features, and known contaminants.
- Chemical filtering: Apply H/C, N/C, O/C ratios and magnitude thresholds.
- Reproducibility metrics:
- Identify common vs. unique features across replicates.
- Calculate replicate intensity differences (RIdiff) and thresholds.
- Average replicates: Compute mean peak intensities for repeated molecular formulas.
- Visualizations:
- Average peak intensities per retention time
- RSD distribution across replicates
- Export results: Cleaned datasets, reproducibility summary, and plots are saved in the output/ folder.
R Packages Used: tidyverse, data.table, NLP, jsonlite
File: QC_xenobiotics_statistical_analysis.Rmd
This analysis performs quality control on xenobiotics datasets according to WADA guidelines.
- Load raw data: Import xenobiotics QC measurements.
- Calculate measured concentration: Compute mean ratios, standard deviation, and CV (%).
- Relative accuracy (% Bias): Compare blood and urine measurements against reference matrix.
- Precision:
- Intra-day: Variability within the same day.
- Inter-day: Variability across different days.
- System suitability: Assess retention time (RT) reproducibility and internal standard precision.
- Visualizations:
- Accuracy and CV across matrices
- Intra- and inter-day precision
- Export results: Processed datasets and plots are saved in the output/ folder. R Packages Used: tidyverse
Clone the repository:
git clone git@github.com:beccamatos/quality_control_for_data_quality.git
cd quality_control_for_data_quality
Open the project in RStudio using quality_control_stats.Rproj. Render the Markdown files to HTML:
rmarkdown::render("QC_omics_statistical_analysis.Rmd")
rmarkdown::render("QC_xenobiotics_statistical_analysis.Rmd")