Skip to content

Tools for predicting candidate genes at disease risk loci

Notifications You must be signed in to change notification settings

kjgaulton/gene-pred

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gene-pred
gene prediction scripts and data

# gene annotation mapping

stig
runs semantic mapping
requires: Perl PDL library, Lingua::Stem; R packages lsa and SnowballC
run: ./stig --q res/T2D.query.manual.txt --db data/gene-db/ --o output_file1
--q, query .txt file
--db, database of gene annotation files
--o, output

map.semantic
maps semantic output to genes at T2D loci
run: ./map.semantic res/T2D-loci.all.genes2 output_file1 > output_file1-2

# diabetes association mapping

run.pred
runs association data prediction 
run: ./run.pred --i data/T2D.loci --a data/T2D-loci.out.gz --o output_file2 --cds data/T2D-loci.CDS --dhs data/T2D-loci.DHS --dist data/T2D-loci.dist-50kb
--i, list of loci to run
--a, file of variant association data at each locus
--o, output
--cds, file of coding annotations
--dhs, file of DHS annotations
--dist, file of distance annotations

frmt.pred - format prediction output
fun: ./frmt.pred output_file2 > output_file2-2

# merge results
merge
run: ./merge output_file1-2 output_file2-2 > merged.output

# resources
Resource package can be found here
https://www.dropbox.com/sh/d1z9ota2uw807to/AADbJ_suBCDL7xXxbRzVnbCva?dl=0

Download and unpack the resources into this directory

About

Tools for predicting candidate genes at disease risk loci

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •