Gene Expression Omnibus


clinvar

Student Project done by:

  • Cheung Lung Chun, Joe
  • Tsoi Ka Wai, Raymond
  • Hu Yashi, Karen
  • Kwok Kam Chak, George

Their slides can be downloaded here

Gene Expression Omnibus (GEO) is a public functional genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.

The three main goals of GEO are to:

  • Provide a robust, versatile database in which to efficiently store high-throughput functional genomic data (see Data organization)
  • Offer simple submission procedures and formats that support complete and well-annotated data deposits from the research community (see Submission guide)
  • Provide user-friendly mechanisms that allow users to query, locate, review and download studies and gene expression profiles of interest (see Query and analysis)

Please see the GEO Documentation listings to find more information about various aspects of GEO.

clinvar Official Website

Query and Analysis

GEO data can be retrieved and analyzed in several ways:

  • To look at a particular GEO record for which you have the accession number, use the GEO accession box located on the GEO homepage or at the top of each GEO record.

  • To download data, see the various options described on the Download GEO data page.

  • To quickly locate data relevant to your interests, search GEO DataSets and GEO Profiles:

    GEO DataSets is a study-level database which users can search for studies relevant to their interests. The database stores descriptions of all original submitter-supplied records, as well as curated DataSets. More information about GEO DataSets and how to interpret GEO DataSets results pages can be found on the About GEO DataSets page.

    GEO Profiles is a gene-level database which users can search for gene expression profiles relevant to their interests. More information about GEO Profiles and how to interpret GEO Profiles results pages can be found on the About GEO Profiles page.

GEO DataSet and GEO Profiles searches may be effectively performed by simply entering appropriate keywords and phrases into the search box. However, given the large volumes of data stored in these databases, it is often useful to perform more refined queries in order to filter down to the most relevant data. Examples and full details about how to perform sophisticated queries are provided in the Querying GEO DataSets and GEO Profiles page. Additionally, the Limits and Advanced Search tools, linked at the head of the GEO DataSets and GEO Profiles pages, assist greatly in the construction of complex queries:

  1. GEO DataSets Limits Query Builder

  2. GEO DataSets Advanced Search

  3. GEO Profiles Limits Query Builder

  4. GEO Profiles Advanced Search

Once you have identified a DataSet of interest there are several features on the DataSet record that help identify interesting gene expression profiles within that study, including a t-test tool and clusters. Full information about these features is provided on the About GEO DataSets page.

Once you have identified gene expression profiles of interest there are several links on the Profile records that help identify additional genes of interest, including similarly expressed genes or genes within close proximity on the chromosome. Full information about these links is provided on the About GEO Profiles page.

GEO Architecture - Four Kinds of Data Records

  1. Platform (GPL) = the technology used and the features detected

  2. Sample (GSM)preparation and description of sample.

  3. Series (GSE)
    a set of sample and how they are related.

  4. Data (GDS)
    sample data collection assembled by GEO staff.


Top


Related