dipper.sources.Bgee module

class dipper.sources.Bgee.Bgee(graph_type, are_bnodes_skolemized, tax_ids=None, version=None)

Bases: dipper.sources.Source.Source

Bgee is a database to retrieve and compare gene expression patterns between animal species.

Bgee first maps heterogeneous expression data (currently RNA-Seq, Affymetrix, in situ hybridization, and EST data) to anatomy and development of different species.

Then, in order to perform automated cross species comparisons, homology relationships across anatomies, and comparison criteria between developmental stages, are designed.

BGEE_FTP = 'ftp.bgee.org'
DEFAULT_TAXA = [10090, 10116, 13616, 28377, 6239, 7227, 7955, 8364, 9031, 9258, 9544, 9593, 9597, 9598, 9606, 9823, 9913]
checkIfRemoteIsNewer(localfile, remote_size, remote_modify)

Overrides checkIfRemoteIsNewer in Source class

Parameters:
  • localfile – str file path
  • remote_size – str bytes
  • remote_modify – str last modify date in the form 20160705042714
Returns:

boolean True if remote file is newer else False

fetch(is_dl_forced=False)
Parameters:is_dl_forced – boolean, force download
Returns:
files = {'anat_entity': {'path': '/download/ranks/anat_entity/', 'pattern': re.compile('.*_all_data_.*')}}
parse(limit=None)

Given the input taxa, expects files in the raw directory with the name {tax_id}_anat_entity_all_data_Pan_troglodytes.tsv.zip

Parameters:limit – int Limit to top ranked anatomy associations per group
Returns:None