|
|
|
||||
| Home Help Feedback Subscriptions Archive Search Table of Contents | |||||
First published online March 19, 2008
doi: 10.1242/10.1242/jcs.022400
Cell Science at a Glance |
National Human Genome Research Institute, National Institutes of Health, 49 Convent Drive, Bethesda, MD 20892, USA
* Author for correspondence (e-mail: jsegre{at}mail.nih.gov)
| Introduction |
|---|
|
|
|---|
|
| Viewing a gene and its genomic region |
|---|
|
|
|---|
The Ensembl website provides a more traditional, gene-centric view. To view the mouse Gata3 gene, select Mus musculus as the species, type `gata-3' or `gata3' as the search term. Then, on the returned Ensembl Mouse SearchView page, select the `Ensembl protein_coding Gene' for Gata-3. The gene report page displays the gene's chromosomal position. An arrow (within the `Features' image) indicates the transcript orientation. Additional features include alignments with other species, orthologue and paralogue predictions, and links to other genome-based web resources.
| Viewing a transcript sequence |
|---|
|
|
|---|
To view the transcript sequence for Gata-3 from the Ensembl Gene Report, click on `Transcript information' (listed on the left) on the Ensembl Gene Report page. One can then opt to view the exons and codons by choosing this option from the drop-down list `Show the following features' at the bottom of the Ensembl Transcript Report page. Exons are distinguished by the alternating blue and black text and UTRs are displayed against a dark yellow background. To view the complete intronic sequences select `Exon information' from the menu on the lefthand side and tick the `show full intronic sequence' option on the Ensembl Exon Report page.
| Designing primers and ensuring specificity |
|---|
|
|
|---|
To ensure sequence specificity, the BLAT feature of the UCSC site will tell you whether your primers align with your gene of interest and/or elsewhere in the genome (Kent, 2002
). Select BLAT from any of the UCSC web pages and paste and submit your primers. In a returned BLAT search, the presence of multiple matches indicates a lack of specificity. Also displayed are the genomic coordinates of the primers, which can be very useful to organize your reagents. However, a `not found' BLAT Search result does not necessarily mean that your primer does not match because BLAT has limited specificity for sequences
40 bp. Another way of double-checking the specificity and correct orientation of primers is by using UCSC's virtual, In-Silico PCR tool. Select `PCR' from any of the UCSC web pages, and paste in the designated primers (forward and reverse) and define the maximum product size to generate the coordinates and amplicon genomic sequence.
| Predicting the transcription start site |
|---|
|
|
|---|
| Finding conserved noncoding regulatory sequences |
|---|
|
|
|---|
Conserved sequences can be further interrogated by computational programs such as TRANSFAC (http://www.gene-regulation.com/pub/databases.html) (Wingender et al., 2000
) and rVISTA (http://genome.lbl.gov/vista/rvista/submit.shtml) (Loots et al., 2002
) to identify specific transcription factor-binding-sites. MEME is a complementary approach that identifies common motifs in a set of queried sequences (http://meme.sdsc.edu/meme/intro.html) (Bailey and Elkan, 1994
). Ultimately, these genomic approaches may narrow the regions to test but only provide hypothetical regulatory sequences for further laboratory analysis.
| Annotating a genomic region(s) |
|---|
|
|
|---|
| Concluding remarks |
|---|
|
|
|---|
The public genome initiative ENCODE (for ENCyclopedia Of DNA Elements) seeks to identify the function of every sequence in the genome. Using multiple, diverse experiments to ascertain the function of a select 1% of the genome, the ENCODE pilot phase identified more pervasive transcription than previously predicted, a specific relationship between transcription start sites, chromatin structure and histone modification, correlation of early and late replication with regions of gene activation and repression, respectively, as well as new insights into the human genomic landscape based on comparisons between inter- and intraspecies genome analysis (ENCODE Consortium, 2007
).
As cell biologists, we test hypotheses to decipher cellular functions and mechanisms, many of which originate with specific gene expression. Cell-type-specific features, such as transcription factor occupancy and histone modifications to predict active promoters, enhancers and silencers are now being surveyed on a genome-wide scale with next-generation sequencing technology and integrated onto genome web browsers. Mapping the genomic landscapes of specific cell types should provide further predictive insights into cellular functions and mechanisms.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Bailey, T. L. and Elkan, C. (1994). Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28-36.[Medline]
Blanchette, M., Kent, W. J., Riemer, C., Elnitski, L., Smit, A. F., Roskin, K. M., Baertsch, R., Rosenbloom, K., Clawson, H., Green, E. D. et al. (2004). Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708-715.
ENCODE Consortium (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799-816.[CrossRef][Medline]
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M. and Dubchak, I. (2004). VISTA: computational tools for comparative genomics. Nucleic Acids Res. 32, W273-W279.
Hardison, R., Slightom, J. L., Gumucio, D. L., Goodman, M., Stojanovic, N. and Miller, W. (1997). Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene 205, 73-94.[CrossRef][Medline]
Hubbard, T. J., Aken, B. L., Beal, K., Ballester, B., Caccamo, M., Chen, Y., Clarke, L., Coates, G., Cunningham, F., Cutts, T. et al. (2007). Ensembl 2007. Nucleic Acids Res. 35, D610-D617.[CrossRef][Medline]
Kent, W. J. (2002). BLAT – the BLAST-like alignment tool. Genome Res. 12, 656-664.
Kent, W. J., Sugnet, C. W., Furey, T. S., Roskin, K. M., Pringle, T. H., Zahler, A. M. and Haussler, D. (2002). The human genome browser at UCSC. Genome Res. 12, 996-1006.
Krek, A., Grun, D., Poy, M. N., Wolf, R., Rosenberg, L., Epstein, E. J., MacMenamin, P., da Piedade, I., Gunsalus, K. C., Stoffel, M. et al. (2005). Combinatorial microRNA target predictions. Nat. Genet. 37, 495-500.[CrossRef][Medline]
Lewis, B. P., Burge, C. B. and Bartel, D. P. (2005). Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120, 15-20.[CrossRef][Medline]
Loots, G. G., Ovcharenko, I., Pachter, L., Dubchak, I. and Rubin, E. M. (2002). rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res. 12, 832-839.
Rozen, S. and Skaletsky, H. (2000). Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365-386.[Medline]
Schwartz, S., Elnitski, L., Li, M., Weirauch, M., Riemer, C., Smit, A., Green, E. D., Hardison, R. C. and Miller, W. (2003). MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518-3524.
Siepel, A., Bejerano, G., Pedersen, J. S., Hinrichs, A. S., Hou, M., Rosenbloom, K., Clawson, H., Spieth, J., Hillier, L. W., Richards, S. et al. (2005). Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034-1050.
Wingender, E., Chen, X., Hehl, R., Karas, H., Liebich, I., Matys, V., Meinhardt, T., Pruss, M., Reuter, I. and Schacherer, F. (2000). TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res. 28, 316-319.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||