dipper.sources.Bgee module¶
-
class
dipper.sources.Bgee.
Bgee
(graph_type, are_bnodes_skolemized, data_release_version=None, tax_ids=None, version=None)¶ Bases:
dipper.sources.Source.Source
Bgee is a database to retrieve and compare gene expression patterns between animal species.
Bgee first maps heterogeneous expression data (currently RNA-Seq, Affymetrix, in situ hybridization, and EST data) to anatomy and development of different species.
Then, in order to perform automated cross species comparisons, homology relationships across anatomies, and comparison criteria between developmental stages, are designed.
-
check_if_remote_is_newer
(localfile, remote_size, remote_modify)¶ Overrides check_if_remote_is_newer in Source class
Parameters: - localfile – str file path
- remote_size – str bytes
- remote_modify – str last modify date in the form 20160705042714
Returns: boolean True if remote file is newer else False
-
default_species
= ['Cavia porcellus', 'Mus musculus', 'Rattus norvegicus', 'Monodelphis domestica', 'Anolis carolinensis', 'Caenorhabditis elegans', 'Drosophila melanogaster', 'Danio rerio', 'Xenopus (Silurana) tropicalis', 'Gallus gallus', 'Ornithorhynchus anatinus', 'Erinaceus europaeus', 'Macaca mulatta', 'Gorilla gorilla', 'Pan paniscus', 'Pan troglodytes', 'Homo sapiens', 'Canis lupus familiaris', 'Felis catus', 'Equus caballus', 'Sus scrofa', 'Bos taurus', 'Oryctolagus cuniculus']¶
-
fetch
(is_dl_forced=False)¶ Parameters: is_dl_forced – boolean, force download Returns:
-
files
= {'anat_entity': {'columns': ['Ensembl gene ID', 'gene name', 'anatomical entity ID', 'anatomical entity name', 'rank score', 'XRefs to BTO'], 'path': '/download/ranks/anat_entity/', 'pattern': re.compile('^[0-9]+_anat_entity_all_data_.*.tsv.gz')}}¶
-
parse
(limit=None)¶ Given the input taxa, expects files in the raw directory with the name {tax_id}_anat_entity_all_data_Pan_troglodytes.tsv.zip
Parameters: limit – int Limit to top ranked anatomy associations per group Returns: None
-