This repository creates user-friendly extensions to the Disease Ontology (DO) [1]. Simple TSV files are extracted from the OBO-formatted ontology including datasets for term names, cross-references, and subsumption relationships. Additionally, a slim term set is extracted, which we use for our drug repurposing research.
DO-xrefs.ipynb extracts cross-references from download/HumanDO.obo and produces easy-to-read mappings files. data/xref-prop.tsv contains propagated cross-references, so that for example xrefs to relapsing remitting multiple sclerosis would be transmitted to multiple sclerosis.
slim.ipynb reads DO Slim terms and generates slim-specific datasets.
IGS_scripts contains the scripts from the IGS/disease-ontology repo. These scripts were converted into python 3 and a few conversion errors were manually fixed.
download contains a subversion checkout of the master DO.
data contains created datasets which include:
term-names.tsv— names including synonyms for DO termsxrefs.tsv— cross-references to external disease vocabulariesxrefs-prop.tsv— cross-references where diseases inherit all cross-references of the diseases they subsumeslim-terms.tsv— a (semi-manually created) slim term set referred to as DO Slimslim-terms-prop.tsv— all subsume relationships for DO Slimxrefs-slim.tsv— cross-references to external disease vocabularies for slim termsxrefs-prop-slim.tsv— cross-references for slim terms where diseases inherit all cross-references of the diseases they subsume.
Disease Ontology content and derivatives are licensed under CC-BY 3.0. All original content is licensed under CC0 1.0.