This repository contains the analysis and plotting scripts for the HiChIP manuscript, designed to process and analyze data from TCGA enhancers and gene interactions.
Clone the repository:
git clone https://github.com/NCICCGPO/HiChIP-Manuscript.git
cd HiChIP-ManuscriptInstall the necessary dependencies for R and Python scripts. You may need R, Python 3.x, and the following libraries:
- R packages:
ggplot2,dplyr, etc. - Python packages:
matplotlib,numpy, etc.
For R:
install.packages(c("ggplot2", "dplyr", ...))For Python:
pip install matplotlib numpyThe repository contains multiple R and Python scripts, each performing different tasks related to HiChIP analysis. Below are the primary scripts and their usage:
-
TCGA Enhancer Analysis:
1_TCGA_Enhancers_Gene_Interactions.r: Analysis of enhancer-gene interactions using TCGA data.2_TCGA_Enhancer_Copy_Number_Regression.r: Performs regression analysis on enhancer copy number data.
Example:
Rscript 1_TCGA_Enhancers_Gene_Interactions.r
-
RNA Modeling:
3_TCGA_RNA_Modeling.r: RNA modeling analysis for TCGA data.4_TCGA_RNA_Modeling_Analysis.r: Detailed RNA modeling analysis.
Example:
Rscript 3_TCGA_RNA_Modeling.r
-
Visualization:
Plot_heatmaps.R: Plots heatmaps for various interaction data.plot_neoloop_dist.py: Python script for plotting neo-loop distributions.
Example:
python plot_neoloop_dist.py
For more detailed usage, see each script's comments and documentation.
-
Enhancer Analysis:
1_TCGA_Enhancers_Gene_Interactions.r2_TCGA_Enhancer_Copy_Number_Regression.rTCGA_King_Enhancer_Interaction_Functions.r
-
RNA Modeling:
3_TCGA_RNA_Modeling.r4_TCGA_RNA_Modeling_Analysis.rTCGA_King_RNA_modeling_functions.r
-
Visualization:
Plot_heatmaps.Rplot_neoloop.pyPlot_EIS.R
-
Workflow Scripts:
HiChIP_decomposition_workflow.R: Main workflow for HiChIP data decomposition.Non_coding_workflow_Final.R: Non-coding region workflow.
If you'd like to contribute, please fork the repository and use a feature branch. Pull requests are welcome.
This project is licensed under the MIT License - see the LICENSE file for details.
This project was developed with data from TCGA and contributions from multiple collaborators in the HiChIP community.