Data Resources

From Evolutionary Informatics Working Group
Revision as of 15:30, 22 May 2008 by Arlin.stoltzfus@nist.gov (talk) (TreeBase)
Jump to: navigation, search

Quick analysis of data resources to target for interop hackathon

The idea is that each of these resources has comparative data that we might want to make interoperable.

questions to ask for each resource

Phylogeny Services

Phylogeny.Fr

http://www.phylogeny.fr

output formats are Newick, NHX and Phylip, but apparently no means to export alignment and tree together.

Data Resources

TreeBase

TreeBaseII has the kind of granular schema that would be a good challenge to try to accommodate using cdao. What is missing from cdao (and nexml) is the notion of a "study" with various types of metadata, including publication/reference metadata (which is somewhat dc-like).

  • TreeBase 1996 paper cited 21 times

TreeFam

TreeFam

  • citations for 2006 paper: 36

Hovergen, hogenom

Hovergen, hogenom

  • hovergen 1994 paper cited 157 times

Pandit

Pandit

pPOD

pPOD (not really a data resource, its a db tech project led by computer scientists)

organism-centered

organism-centered gene-family databases, e.g., for plasmodium