172 Matching Results

Search Results

Advanced search parameters have been applied.

Bacterial Microcompartments

Description: Bacterialmicrocompartments (BMCs) are organelles composed entirely of protein. They promote specific metabolic processes by encapsulatingand colocalizing enzymes with their substrates and cofactors, by protecting vulnerable enzymes in a defined microenvironment, and bysequestering toxic or volatile intermediates. Prototypes of the BMCsare the carboxysomes of autotrophic bacteria. However, structures of similarpolyhedral shape are being discovered in an ever-increasing number of heterotrophic bacteria, where they participate in the utilization ofspecialty carbon and energy sources.Comparative genomics reveals that the potential for this type of compartmentalization is widespread acrossbacterial phyla and suggests that genetic modules encoding BMCs are frequently laterally transferred among bacteria. The diverse functionsof these BMCs suggest that they contribute to metabolic innovation in bacteria in a broad range of environments.
Date: June 5, 2010
Creator: Kerfeld, Cheryl A.; Heinhorst, Sabine & Cannon, Gordon C.
Partner: UNT Libraries Government Documents Department

Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

Description: Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities to known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.
Date: March 23, 2010
Creator: Consortium, Catfish Genome; Wang, Shaolin; Peatman, Eric; Abernathy, Jason; Waldbieser, Geoff; Lindquist, Erika et al.
Partner: UNT Libraries Government Documents Department

Bacillus anthracis genome organization in light of whole transcriptome sequencing

Description: Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computational predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.
Date: March 22, 2010
Creator: Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.; Bergman, Nicholas & Borodovsky, Mark
Partner: UNT Libraries Government Documents Department

Complete genome sequence of Denitrovibrio acetiphilus type strain (N2460T)

Description: Denitrovibrio acetiphilus Myhr and Torsvik 2000 is the type species of the genus Denitrovibrio in the bacterial family Deferribacteraceae. It is of phylogenetic interest because there are only six genera described in the family Deferribacteraceae. D. acetiphilus was isolated as a representative of a population reducing nitrate to ammonia in a laboratory column simulating the conditions in off-shore oil recovery fields. When nitrate was added to this column undesirable hydrogen sulfide production was stopped because the sulfate reducing populations were superseded by these nitrate reducing bacteria. Here we describe the features of this marine, mesophilic, obligately anaerobic organism respiring by nitrate reduction, together with the complete genome sequence, and annotation. This is the second complete genome sequence of the order Deferribacterales and the class Deferribacteres, which is the sole class in the phylum Deferribacteres. The 3,222,077 bp genome with its 3,034 protein-coding and 51 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.
Date: June 25, 2010
Creator: Kiss, Hajnalka; Lang, Elke; Lapidus, Alla; Copeland, Alex; Nolan, Matt; Glavina Del Rio, Tijana et al.
Partner: UNT Libraries Government Documents Department

ChIP-seq Identification of Weakly Conserved Heart Enhancers

Description: Accurate control of tissue-specific gene expression plays a pivotal role in heart development, but few cardiac transcriptional enhancers have thus far been identified. Extreme non-coding sequence conservation successfully predicts enhancers active in many tissues, but fails to identify substantial numbers of heart enhancers. Here we used ChIP-seq with the enhancer-associated protein p300 from mouse embryonic day 11.5 heart tissue to identify over three thousand candidate heart enhancers genome-wide. Compared to other tissues studied at this time-point, most candidate heart enhancers are less deeply conserved in vertebrate evolution. Nevertheless, the testing of 130 candidate regions in a transgenic mouse assay revealed that most of them reproducibly function as enhancers active in the heart, irrespective of their degree of evolutionary constraint. These results provide evidence for a large population of poorly conserved heart enhancers and suggest that the evolutionary constraint of embryonic enhancers can vary depending on tissue type.
Date: July 1, 2010
Creator: Blow, Matthew J.; McCulley, David J.; Li, Zirong; Zhang, Tao; Akiyama, Jennifer A.; Holt, Amy et al.
Partner: UNT Libraries Government Documents Department

The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

Description: Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set of tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.
Date: January 27, 2010
Creator: Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika; Tanaka, Yoshihiro; Teranishi, Kristen S.; Sunagawa, Shinichi et al.
Partner: UNT Libraries Government Documents Department

The Complete Multipartite Genome Sequence of Cupriavidus necator JMP134, a Versatile Pollutant Degrader

Description: Cupriavidus necator JMP134 (formerly Ralstonia eutropha JMP134) is a Gram-negative {beta}-proteobacterium able to degrade a variety of chloroaromatic compounds and chemically-related pollutants. It was originally isolated based on its ability to use 2,4 dichlorophenoxyacetic acid (2,4-D) as a sole carbon and energy source [1]. In addition to 2,4-D, this strain can also grow on a variety of aromatic substrates, such as 4-chloro-2-methylphenoxyacetate (MCPA), 3-chlorobenzoic acid (3-CB) [2], 2,4,6-trichlorophenol [3], and 4-fluorobenzoate [4]. The genes necessary for 2,4-D utilization have been identified. They are located in two clusters on plasmid pPJ4: tfd{sub I} and tfd{sub II} [5,6,7,8]. The sequence and analysis of plasmid pJP4 was reported and a congruent model for bacterial adaptation to chloroaromatic pollutants was proposed [9]. According to this model, catabolic gene clusters assemble in a modular manner into broad-host-range plasmid backbones by means of repeated chromosomal capture events. Cupriavidus and related Burkholderia genomes are typically multipartite, composed of two large replicons (chromosomes) accompanied by classical plasmids. Previous work with Burkholderia xenovorans LB400 revealed a differential gene distribution with core functions preferentially encoded by the larger chromosome and secondary functions by the smaller [10]. It has been proposed that the secondary chromosomes in many bacteria originated from ancestral plasmids which, in turn, had been the recipient of genes transferred earlier from ancestral primary chromosomes [11]. The existence of multiple Cupriavidus and Burkholderia genomes provides the opportunity for comparative studies that will lead to a better understanding of the evolutionary mechanisms for the formation of multipartite genomes and the relation with biodegradation abilities.
Date: February 1, 2010
Creator: Lykidis, Athanasios; Perez-Pantoja, Danilo; Ledger, Thomas; Mavromatis, Kostantinos; Anderson, Iain J.; Ivanova, Natalia N. et al.
Partner: UNT Libraries Government Documents Department

Targeted deletion of the 9p21 noncoding coronary artery disease risk interval in mice

Description: Sequence polymorphisms in a 58kb interval on chromosome 9p21 confer a markedly increased risk for coronary artery disease (CAD), the leading cause of death worldwide 1,2. The variants have a substantial impact on the epidemiology of CAD and other life?threatening vascular conditions since nearly a quarter of Caucasians are homozygous for risk alleles. However, the risk interval is devoid of protein?coding genes and the mechanism linking the region to CAD risk has remained enigmatic. Here we show that deletion of the orthologous 70kb noncoding interval on mouse chromosome 4 affects cardiac expression of neighboring genes, as well as proliferation properties of vascular cells. Chr4delta70kb/delta70kb mice are viable, but show increased mortality both during development and as adults. Cardiac expression of two genes near the noncoding interval, Cdkn2a and Cdkn2b, is severely reduced in chr4delta70kb/delta70kb mice, indicating that distant-acting gene regulatory functions are located in the noncoding CAD risk interval. Allelespecific expression of Cdkn2b transcripts in heterozygous mice revealed that the deletion affects expression through a cis-acting mechanism. Primary cultures of chr4delta70kb/delta70kb aortic smooth muscle cells exhibited excessive proliferation and diminished senescence, a cellular phenotype consistent with accelerated CAD pathogenesis. Taken together, our results provide direct evidence that the CAD risk interval plays a pivotal role in regulation of cardiac Cdkn2a/b expression and suggest that this region affects CAD progression by altering the dynamics of vascular cell proliferation.
Date: January 1, 2010
Creator: Visel, Axel; Zhu, Yiwen; May, Dalit; Afzal, Veena; Gong, Elaine; Attanasio, Catia et al.
Partner: UNT Libraries Government Documents Department

Experimental factors affecting PCR-based estimates of microbial species richness and evenness

Description: Pyrosequencing of 16S rRNA gene amplicons for microbial community profiling can, for equivalent costs, yield greater than two orders of magnitude more sensitivity than traditional PCR-cloning and Sanger sequencing. With this increased sensitivity and the ability to analyze multiple samples in parallel, it has become possible to evaluate several technical aspects of PCRbased community structure profiling methods. We tested the effect of amplicon length and primer pair on estimates of species richness number of species and evenness relative abundance of species by assessing the potentially tractable microbial community residing in the termite hindgut. Two regions of the 16S rRNA gene were sequenced from one of two common priming sites, spanning the V1-V2 or V8 regions, using amplicons ranging n length from 352 to 1443 bp. Our results demonstrate that both amplicon length and primer pair markedly influence estimates of richness and evenness. However, estimates of species evenness are consistent among different primer pairs targeting the same region. These results highlight the importance of experimental methodology when comparing diversity estimates across communities.
Date: December 1, 2009
Creator: Engelbrektson, Anna; Kunin, Victor; Wrighton, Kelly C.; Zvenigorodsky, Natasha; Chen, Feng; Ochman, Howard et al.
Partner: UNT Libraries Government Documents Department

Gene context analysis in the Integrated Microbial Genomes (IMG) data management system

Description: Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across a statistically significant and phylogeneticaly diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate and explore gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.
Date: May 1, 2009
Creator: Mavromatis, Konstantinos; Chu, Ken; Ivanova, Natalia; Hooper, Sean D.; Markowitz, Victor M. & Kyrpides, Nikos C.
Partner: UNT Libraries Government Documents Department

Assembling the Marine Metagenome, One Cell at a Time

Description: The difficulty associated with the cultivation of most microorganisms and the complexity of natural microbial assemblages, such as marine plankton or human microbiome, hinder genome reconstruction of representative taxa using cultivation or metagenomic approaches. Here we used an alternative, single cell sequencing approach to obtain high-quality genome assemblies of two uncultured, numerically significant marine microorganisms. We employed fluorescence-activated cell sorting and multiple displacement amplification to obtain hundreds of micrograms of genomic DNA from individual, uncultured cells of two marine flavobacteria from the Gulf of Maine that were phylogenetically distant from existing cultured strains. Shotgun sequencing and genome finishing yielded 1.9 Mbp in 17 contigs and 1.5 Mbp in 21 contigs for the two flavobacteria, with estimated genome recoveries of about 91percent and 78percent, respectively. Only 0.24percent of the assembling sequences were contaminants and were removed from further analysis using rigorous quality control. In contrast to all cultured strains of marine flavobacteria, the two single cell genomes were excellent Global Ocean Sampling (GOS) metagenome fragment recruiters, demonstrating their numerical significance in the ocean. The geographic distribution of GOS recruits along the Northwest Atlantic coast coincided with ocean surface currents. Metabolic reconstruction indicated diverse potential energy sources, including biopolymer degradation, proteorhodopsin photometabolism, and hydrogen oxidation. Compared to cultured relatives, the two uncultured flavobacteria have small genome sizes, few non-coding nucleotides, and few paralogous genes, suggesting adaptations to narrow ecological niches. These features may have contributed to the abundance of the two taxa in specific regions of the ocean, and may have hindered their cultivation. We demonstrate the power of single cell DNA sequencing to generate reference genomes of uncultured taxa from a complex microbial community of marine bacterioplankton. A combination of single cell genomics and metagenomics enabled us to analyze the genome content, metabolic adaptations, and biogeography of these taxa.
Date: June 24, 2010
Creator: Woyke, Tanja; Xie, Gary; Copeland, Alex; Gonzalez, Jose M.; Han, Cliff; Kiss, Hajnalka et al.
Partner: UNT Libraries Government Documents Department

Gap Closing/Finishing by Targeted Genomic Region Enrichment and Sequencing

Description: Gap Closing/Finishing of draft genome assemblies is a labor and cost intensive process where several rounds of repetitious amplification and sequencing are required. Here we demonstrate a high throughput procedure where custom primers flanking gaps in draft genomes are designed. Primer libraries containing up to 4,000 unique pairs in independent droplets are merged with a fragmented genomic template. From this millions of picoliter scale droplets are formed, each one being the functional equivalent of an individual PCR reaction. The PCR products are concatenated and sequenced by Illumina which is then assembled and used for gap closure. Here we present an overall experimental strategy, primer design algorithm and initial results.
Date: May 27, 2010
Creator: Singh, Kanwar; Froula, Jeff; Trice, Hope; Pennacchio, Len A. & Chen, Feng
Partner: UNT Libraries Government Documents Department

Analysis of Illumina Microbial Assemblies

Description: Since the emerging of second generation sequencing technologies, the evaluation of different sequencing approaches and their assembly strategies for different types of genomes has become an important undertaken. Next generation sequencing technologies dramatically increase sequence throughput while decreasing cost, making them an attractive tool for whole genome shotgun sequencing. To compare different approaches for de-novo whole genome assembly, appropriate tools and a solid understanding of both quantity and quality of the underlying sequence data are crucial. Here, we performed an in-depth analysis of short-read Illumina sequence assembly strategies for bacterial and archaeal genomes. Different types of Illumina libraries as well as different trim parameters and assemblers were evaluated. Results of the comparative analysis and sequencing platforms will be presented. The goal of this analysis is to develop a cost-effective approach for the increased throughput of the generation of high quality microbial genomes.
Date: May 28, 2010
Creator: Clum, Alicia; Foster, Brian; Froula, Jeff; LaButti, Kurt; Sczyrba, Alex; Lapidus, Alla et al.
Partner: UNT Libraries Government Documents Department

Complete genome sequence of Coraliomargarita akajimensis type strain (04OKA010-24T)

Description: Coraliomargarita akajimensis Yoon et al. 2007 the type species of the genus Coraliomargarita. C. akajimensis is an obligately aerobic, Gram-negative, non-spore-forming, non-motile, spherical bacterium which was isolated from seawater surrounding the hard coral Galaxea fascicularis. C. akajimensis organism is of special interest because of its phylogenetic position in a genomically purely studied area in the bacterial diversity. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a member of the family Puniceicoccaceae. The 3,750,771 bp long genome with its 3,137 protein-coding and 55 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
Date: June 25, 2010
Creator: Mavromatis, Konstantinos; Abt, Birte; Brambilla, Evelyne; Lapidus, Alla; Copeland, Alex; Desphande, Shweta et al.
Partner: UNT Libraries Government Documents Department

Structural Determinats Underlying Photoprotection in the Photoactive Orange Carotenoid Protein of Cyanobacteria

Description: The photoprotective processes of photosynthetic organisms involve the dissipation of excess absorbed light energy as heat. Photoprotection in cyanobacteria is mechanistically distinct from that in plants; it involves the Orange Carotenoid Protein (OCP), a water-soluble protein containing a single carotenoid. The OCP is a new member of the family of blue light photoactive proteins; blue-green light triggers the OCP-mediated photoprotective response. Here we report structural and functional characterization of the wildtype and two mutant forms of the OCP, from the model organism Synechocystis PCC6803. The structural analysis provides highresolution detail of the carotenoidprotein interactions that underlie the optical properties of the OCP, unique among carotenoid-proteins in binding a single pigment per polypeptide chain. Collectively, these data implicate several key amino acids in the function of the OCP and reveal that the photoconversion and photoprotective responses of the OCP to blue-green light can be decoupled.
Date: April 1, 2010
Creator: Wilson, Adjele; Kinney, James N.; Zwart, Petrus H.; Punginelli, Claire; D'Haene, Sandrine; Perreau, Francois et al.
Partner: UNT Libraries Government Documents Department

Transcriptomic response of the mycoparasitic fungus Trichoderma atroviride to the presence of a fungal prey

Description: BACKGROUND: Combating the action of plant pathogenic microorganisms by mycoparasitic fungi has been announced as an attractive biological alternative to the use of chemical fungicides since two decades. The fungal genus Trichoderma includes a high number of taxa which are able to recognize, combat and finally besiege and kill their prey. Only fragments of the biochemical processes related to this ability have been uncovered so far, however. RESULTS: We analyzed genome-wide gene expression changes during the begin of physical contact between Trichoderma atroviride and two plant pathogens Botrytis cinerea and Rhizoctonia solani, and compared with gene expression patterns of mycelial and conidiating cultures, respectively. About 3000 ESTs, representing about 900 genes, were obtained from each of these three growth conditions. 66 genes, represented by 442 ESTs, were specifically and significantly overexpressed during onset of mycoparasitism, and the expression of a subset thereof was verified by expression analysis. The upregulated genes comprised 18 KOG groups, but were most abundant from the groups representing posttranslational processing, and amino acid metabolism, and included components of the stress response, reaction to nitrogen shortage, signal transduction and lipid catabolism. Metabolic network analysis confirmed the upregulation of the genes for amino acid biosynthesis and of those involved in the catabolism of lipids and aminosugars. CONCLUSION: The analysis of the genes overexpressed during the onset of mycoparasitism in T. atroviride has revealed that the fungus reacts to this condition with several previously undetected physiological reactions. These data enable a new and more comprehensive interpretation of the physiology of mycoparasitism, and will aid in the selection of traits for improvement of biocontrol strains by recombinant techniques.
Date: July 23, 2010
Creator: Seidl, Verena; Song, Lifu; Lindquist, Erika; Gruber, Sabine; Koptchinskiy, Alexeji; Zeilinger, Susanne et al.
Partner: UNT Libraries Government Documents Department

Estimating DNA coverage and abundance in metagenomes using a gamma approximation

Description: Shotgun sequencing generates large numbers of short DNA reads from either an isolated organism or, in the case of metagenomics projects, from the aggregate genome of a microbial community. These reads are then assembled based on overlapping sequences into larger, contiguous sequences (contigs). The feasibility of assembly and the coverage achieved (reads per nucleotide or distinct sequence of nucleotides) depend on several factors: the number of reads sequenced, the read length and the relative abundances of their source genomes in the microbial community. A low coverage suggests that most of the genomic DNA in the sample has not been sequenced, but it is often difficult to estimate either the extent of the uncaptured diversity or the amount of additional sequencing that would be most efficacious. In this work, we regard a metagenome as a population of DNA fragments (bins), each of which may be covered by one or more reads. We employ a gamma distribution to model this bin population due to its flexibility and ease of use. When a gamma approximation can be found that adequately fits the data, we may estimate the number of bins that were not sequenced and that could potentially be revealed by additional sequencing. We evaluated the performance of this model using simulated metagenomes and demonstrate its applicability on three recent metagenomic datasets.
Date: January 1, 2010
Creator: Hooper, Sean D; Dalevi, Daniel; Pati, Amrita; Mavromatis, Konstantinos; Ivanova, Natalia N & Kyrpides, Nikos C
Partner: UNT Libraries Government Documents Department

Development of High Throughput Process for Constructing 454 Titanium and Illumina Libraries

Description: We have developed two processes with the Biomek FX robot to construct 454 titanium and Illumina libraries in order to meet the increasing library demands. All modifications in the library construction steps were made to enable the adaptation of the entire processes to work with the 96-well plate format. The key modifications include the shearing of DNA with Covaris E210 and the enzymatic reaction cleaning and fragment size selection with SPRI beads and magnetic plate holders. The construction of 96 Titanium libraries takes about 8 hours from sheared DNA to ssDNA recovery. The processing of 96 Illumina libraries takes less time than that of the Titanium library process. Although both processes still require manual transfer of plates from robot to other work stations such as thermocyclers, these robotic processes represent about 12- to 24-folds increase of library capacity comparing to the manual processes. To enable the sequencing of many libraries in parallel, we have also developed sets of molecular barcodes for both library types. The requirements for the 454 library barcodes include 10 bases, 40-60percent GC, no consecutive same base, and no less than 3 bases difference between barcodes. We have used 96 of the resulted 270 barcodes to construct libraries and pool to test the ability of accurately assigning reads to the right samples. When allowing 1 base error occurred in the 10 base barcodes, we could assign 99.6percent of the total reads and 100percent of them were uniquely assigned. As for the Illumina barcodes, the requirements include 4 bases, balanced GC, and at least 2 bases difference between barcodes. We have begun to assess the ability to assign reads after pooling different number of libraries. We will discuss the progress and the challenges of these scale-up processes.
Date: May 28, 2010
Creator: Deshpande, Shweta; Hack, Christopher; Tang, Eric; Malfatti, Stephanie; Ewing, Aren; Lucas, Susan et al.
Partner: UNT Libraries Government Documents Department

Illumina Production Sequencing at the DOE Joint Genome Institute - Workflow and Optimizations

Description: The U.S. Department of Energy (DOE) Joint Genome Institute?s (JGI) Production Sequencing group is committed to the generation of high-quality genomic DNA sequence to support the DOE mission areas of renewable energy generation, global carbon management, and environmental characterization and clean-up. Within the JGI?s Production Sequencing group, the Illumina Genome Analyzer pipeline has been established as one of three sequencing platforms, along with Roche/454 and ABI/Sanger. Optimization of the Illumina pipeline has been ongoing with the aim of continual process improvement of the laboratory workflow. These process improvement projects are being led by the JGI?s Process Optimization, Sequencing Technologies, Instrumentation& Engineering, and the New Technology Production groups. Primary focus has been on improving the procedural ergonomics and the technicians? operating environment, reducing manually intensive technician operations with different tools, reducing associated production costs, and improving the overall process and generated sequence quality. The U.S. DOE JGI was established in 1997 in Walnut Creek, CA, to unite the expertise and resources of five national laboratories? Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest ? along with HudsonAlpha Institute for Biotechnology. JGI is operated by the University of California for the U.S. DOE.
Date: June 18, 2010
Creator: Tarver, Angela; Fern, Alison; Diego, Matthew San; Kennedy, Megan; Zane, Matthew; Daum, Christopher et al.
Partner: UNT Libraries Government Documents Department

The Genome of Naegleria gruberi Illuminates Early Eukaryotic Versatility

Description: Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likely present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.
Date: March 1, 2010
Creator: Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.; Dacks, Joel; Carpenter, Meredith L.; Field, Mark C. et al.
Partner: UNT Libraries Government Documents Department

A multi-channel gel electrophoresis and continuous fraction collection apparatus for high throughput protein separation and characterization

Description: To facilitate a direct interface between protein separation by PAGE and protein identification by mass spectrometry, we developed a multichannel system that continuously collects fractions as protein bands migrate off the bottom of gel electrophoresis columns. The device was constructed using several short linear gel columns, each of a different percent acrylamide, to achieve a separation power similar to that of a long gradient gel. A Counter Free-Flow elution technique then allows continuous and simultaneous fraction collection from multiple channels at low cost. We demonstrate that rapid, high-resolution separation of a complex protein mixture can be achieved on this system using SDS-PAGE. In a 2.5 h electrophoresis run, for example, each sample was separated and eluted into 48-96 fractions over a mass range of 10-150 kDa; sample recovery rates were 50percent or higher; each channel was loaded with up to 0.3 mg of protein in 0.4 mL; and a purified band was eluted in two to three fractions (200 L/fraction). Similar results were obtained when running native gel electrophoresis, but protein aggregation limited the loading capacity to about 50 g per channel and reduced resolution.
Date: October 2, 2009
Creator: Choi, Megan; Nordmeyer, Robert A.; Cornell, Earl; Dong, Ming; Biggin, Mark D. & Jin, Jian
Partner: UNT Libraries Government Documents Department

Metagenomic Insights into Evolution of a Heavy Metal-Contaminated Groundwater Microbial Community

Description: Understanding adaptation of biological communities to environmental change is a central issue in ecology and evolution. Metagenomic analysis of a stressed groundwater microbial community reveals that prolonged exposure to high concentrations of heavy metals, nitric acid and organic solvents (~;;50 years) have resulted in a massive decrease in species and allelic diversity as well as a significant loss of metabolic diversity. Although the surviving microbial community possesses all metabolic pathways necessary for survival and growth in such an extreme environment, its structure is very simple, primarily composed of clonal denitrifying ?- and ?-proteobacterial populations. The resulting community is over-abundant in key genes conferring resistance to specific stresses including nitrate, heavy metals and acetone. Evolutionary analysis indicates that lateral gene transfer could be a key mechanism in rapidly responding and adapting to environmental contamination. The results presented in this study have important implications in understanding, assessing and predicting the impacts of human-induced activities on microbial communities ranging from human health to agriculture to environmental management, and their responses to environmental changes.
Date: February 15, 2010
Creator: Hemme, Christopher L.; Deng, Ye; Gentry, Terry J.; Fields, Matthew W.; Wu, Liyou; Barua, Soumitra et al.
Partner: UNT Libraries Government Documents Department