diff --git a/README.md b/README.md
index b6ce120..b4f99ae 100644
--- a/README.md
+++ b/README.md
@@ -26,9 +26,14 @@ The following steps outline how to test run DEPICT on LDL cholesterol GWAS summa
 
 1. Edit `DEPICT/example/ldl_teslovich_nature2010.cfg`
   * Point `plink_executable` to where PLINK executable (version 1.9 or higher) is on our system (e.g. `/usr/bin/plink`)
-2. Run DEPICT on the LDL summary statistics
+  * Point `analysis_path` to absolute path where output files will be written
+  * Point `gwas_summary_statistics_file` to absolute path to gwas summary statistics in the example directory. 
+2. Download the collections file. 
+  *  Download the collections file and place in `data/collections/ld0.5_collection_1000genomespilot_depict_150429.txt.gz` 
+  *  The files is located in https://data.broadinstitute.org/mpg/depict/depict_download/collections/ld0.5_collection_1000genomespilot_depict_150429.txt.gz
+3. Run DEPICT on the LDL summary statistics
   * E.g. `./src/python/depict.py example/ldl_teslovich_nature2010.cfg`
-3. Investigate the results (see the [Wiki](https://github.com/perslab/DEPICT/wiki) for a description of the output format).
+4. Investigate the results (see the [Wiki](https://github.com/perslab/DEPICT/wiki) for a description of the output format).
   * DEPICT loci `ldl_teslovich_nature2010_loci.txt`
   * Gene prioritization results `ldl_teslovich_nature2010_geneprioritization.txt`
   * Gene set enrichment results `ldl_teslovich_nature2010_genesetenrichment.txt`