Annotations of Zebrafish Entrez Genes (EG) by domain superfamilies (SF).

Description

An R object that contains domain superfamily information for Zebrafish Entrez Genes. This data is prepared based on SUPERFAMILY database, which provides SCOP domain architecture assignments to all completely sequenced genomes including eukaryotic genomes. The domain architecture for an Entrez gene is the protein product with the longest length of amino acids. Thus, domain superfamily information for Zebrafish Entrez gene is a list of domain superfamilies (excluding unknown gap) appearing in its domain architecture.

Usage

org.Da.egSF <- dRDataLoader(RData = "org.Da.egSF")

Value

an object of class "GS", a list with following components:

  • set_info: a matrix of nSet X 4 containing gene set information, where nSet is the number of gene sets (i.e. SCOP domain superfamilies). The 4 columns are "setID" (i.e. "SCOP domain identifier"), "name" (i.e. "SCOP domain description"), "namespace" (i.e. "SCOP domain level") and "distance" (i.e. "SCOP domain classification").

  • gs: a list of gene sets, each storing gene members thereof. Always, gene sets are identified by "setID" and gene members identified by "Entrez ID"

References

Morais et al. (2011) SUPERFAMILY 1.75 including a domain-centric gene ontology method. Nucleic Acids Res, 39(Database issue):D427-34.

Andreeva et al. (2008) Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res, 36(Database issue):D419-425

Examples

org.Da.egSF <- dRDataLoader(RData='org.Da.egSF')
'org.Da.egSF' (from https://github.com/hfang-bristol/RDataCentre/blob/master/dnet/1.0.7/org.Da.egSF.RData?raw=true) has been loaded into the working environment (at 2018-01-19 12:37:46)
names(org.Da.egSF)
[1] "gs" "set_info"