Academia.eduAcademia.edu

Outline

Bolbase: a comprehensive genomics database for Brassica oleracea

2013, BMC Genomics

Abstract

Background Brassica oleracea is a morphologically diverse species in the family Brassicaceae and contains a group of nutrition-rich vegetable crops, including common heading cabbage, cauliflower, broccoli, kohlrabi, kale, Brussels sprouts. This diversity along with its phylogenetic membership in a group of three diploid and three tetraploid species, and the recent availability of genome sequences within Brassica provide an unprecedented opportunity to study intra- and inter-species divergence and evolution in this species and its close relatives. Description We have developed a comprehensive database, Bolbase, which provides access to the B. oleracea genome data and comparative genomics information. The whole genome of B. oleracea is available, including nine fully assembled chromosomes and 1,848 scaffolds, with 45,758 predicted genes, 13,382 transposable elements, and 3,581 non-coding RNAs. Comparative genomics information is available, including syntenic regions among B. oleracea,...

References (21)

  1. U N: Genome analysis in brassica with special reference to the experimental formation of B. Napus and peculiar mode of fertilization. Japan J Bot 1935, 7:389-452.
  2. Kalloo G, Bergh BO: Genetic improvement of vegetable crops. Oxford: Pergamon; 1993.
  3. Wang XTM, Pierce G, Lemke C, Nelson LK, Yuksel B, Bowers JE, Marler B, Xiao Y, Lin L, Epps E, Sarazen H, Rogers C, Karunakaran S, Ingles J, Giattina E, Mun JH, Seol YJ, Park BS, Amasino RM, Quiros CF, Osborn TC, Pires JC, Town C, Paterson AH: A physical map of Brassica oleracea shows complexity of chromosomal changes following recursive paleopolyploidizations. BMC Genomics 2011, 12:470-485.
  4. Ayele MHB, Kumar N, Wu H, Xiao Y, Aken SV, Utterback TR, Wortman JR, White OR, Town CD: Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis. Genome Res 2005, 15:487-495.
  5. Bowers JE, Chapman BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 2003, 422(6930):433-438.
  6. Yau-Wen Yang K-NL, Pon-Yean T, Wen-Hsiung L: Rates of nucleotide substitution in angiosperm mitochondrial DNA sequences and dates of divergence between brassica and other angiosperm lineages. J Mol Evol 1999, 48:597-604.
  7. Tae-Jin Yang JSK, Soo-Jin K, Ki-Byung L, Beom-Soon C, Jin-A K, Mina J, et al: Sequence-level analysis of the diploidization process in the triplicated FLOWERING LOCUS C region of brassica Rapa. Plant Cell 2006, 18:1339-1347.
  8. Inaba RNT: Phylogenetic analysis of brassiceae based on the nucleotide sequences of the S-locus related gene, SLR1. Theor Appl Genet 2002, 105(8):1159-1165.
  9. Cheng F, Liu S, Wu J, Fang L, Sun S, Liu B, Li P, Hua W, Wang X: BRAD, the genetics and genomics database for brassica plants. BMC Plant Biol 2011, 11:136.
  10. Huala E, Dickerman AW, Garcia-Hernandez M, Weems D, Reiser L, LaFond F, Hanley D, Kiphart D, Zhuang M, Huang W, et al: The Arabidopsis information resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res 2001, 29(1):102-105.
  11. Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Res 2011, 39(Web Server issue):W29-37.
  12. Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, et al: The Pfam protein families database. Nucleic Acids Res 2012, 40(Database issue):D290-301.
  13. Li L, Stoeckert CJ Jr, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 2003, 13(9):2178-2189.
  14. Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee TH, Jin H, Marler B, Guo H, et al: MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res 2012, 40(7):e49.
  15. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25(17):3389-3402.
  16. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R: InterProScan: protein domains identifier. Nucleic Acids Res 2005, 33(Web Server issue):W116-120.
  17. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet 2000, 25(1):25-29.
  18. O'Donovan C, Martin MJ, Gattiker A, Gasteiger E, Bairoch A, Apweiler R: High-quality protein knowledge resource: SWISS-PROT and TrEMBL. Brief Bioinform 2002, 3(3):275-284.
  19. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, et al: KEGG for linking genomes to life and the environment. Nucleic Acids Res 2008, 36(Database issue):D480-484.
  20. Donlin MJ: Using the generic genome browser (GBrowse). Curr Protoc Bioinformatics 2009, 28:9.9.1-9.9.25.
  21. McKay SJ, Vergara IA, Stajich JE: Using the generic synteny browser (GBrowse_syn). Curr Protoc Bioinformatics 2010, 31:9.12.1-9.12.25. doi:10.1186/1471-2164-14-664 Cite this article as: Yu et al.: Bolbase: a comprehensive genomics database for Brassica oleracea. BMC Genomics 2013 14:664.