Feature #516

Add conservation track from ucsc

Added by Madeleine Ball almost 12 years ago. Updated over 11 years ago.

Assigned To:
Target version:
Start date:
Due date:
% Done:


Estimated time:
Story points:


Blosum100 is a simple measure of conservation, but is a better measure of how disruptive a particular amino acid change will be. We should also have the phyloP score, downloaded from ucsc, for the base in consideration, where available.

The phyloP score is the -log(p) measure of predicted conservation for the specific base (as opposed to phastCons which is a measure of conservation of the base in the context of flanking bases). Note that the measure also includes negative values, which indicate the base in question is predicted to be fast-evolving.

I've downloaded the broadest track, vertebrates (there are also mammal and primates), and written a very simple program (lookup.c) to pull out values given chr# and location. Interestingly, the lookup values agree qualitatively with the ucsc genome browser values, but they are not an exact match - perhaps the phyloP switches used for the ucsc ftp files were slightly different from switches used for the data in the genome browser. It might be worth re-running some of the data.

All can be found on:


Closing this ticket should include folding these data into the GET INSTALL scripts.

[2009, Pollard, Hubisz, Rosenbloom, et al, Gen Res]


#1 Updated by Madeleine Ball almost 12 years ago

Replying to [comment:1 https://www.google.com/accounts/o8/id?id=AItOawmjAqCfwHP2SRB4CAwzldFZzlop-n5GiCo]:
Is this a trac bug? Owner is krobasky (me)

#2 Updated by Ward Vandewege over 11 years ago

  • Project changed from External to GET-Evidence
  • Category deleted (GET-Evidence)

#3 Updated by Madeleine Ball over 11 years ago

  • Assigned To deleted (Jason Bobe)

Assignment to Jason Bobe was an error in the import process. It was assigned to Kim, but since that was a long time ago I'm going to reassign it to nobody.

Also available in: Atom PDF