New approaches to preventing and treating infections, particularly of the respiratory tract, are needed. One promising strategy is to reconfigure microbial communities (microbiomes) within the host to improve defense against pathogens. Probiotics and prebiotics for gastrointestinal (GI) infections offer a template for success. We sought to develop comparable countermeasures for respiratory infections. First, we characterized interactions between the airway microbiome and a biodefense-related respiratory pathogen ( Burkholderia thailandensis ; Bt), using a mouse model of infection. Then, we recovered microbiome constituents from the airway and assessed their ability to re-colonize the airway and protect against respiratory Bt infection. We found that microbiome constituents belonging to Bacillus and related genuses frequently displayed colonization and anti-Bt activity. Comparative growth requirement profiling of these Bacillus strains vs Bt enabled identification of candidate prebiotics. This work serves as proof of concept for airway probiotics, as well as a strong foundation for development of airway prebiotics.
This report describes research conducted to use data science and machine learning methods to distinguish targeted genome editing versus natural mutation and sequencer machine noise. Genome editing capabilities have been around for more than 20 years, and the efficiencies of these techniques has improved dramatically in the last 5+ years, notably with the rise of CRISPR-Cas technology. Whether or not a specific genome has been the target of an edit is concern for U.S. national security. The research detailed in this report provides first steps to address this concern. A large amount of data is necessary in our research, thus we invested considerable time collecting and processing it. We use an ensemble of decision tree and deep neural network machine learning methods as well as anomaly detection to detect genome edits given either whole exome or genome DNA reads. The edit detection results we obtained with our algorithms tested against samples held out during training of our methods are significantly better than random guessing, achieving high F1 and recall scores as well as with precision overall.
We present the draft genome sequences of three Burkholderia thailandensis strains, E421, E426, and DW503. E421 consists of 90 contigs of 6,639,935 bp and 67.73% GC content. E426 consists of 106 contigs of 6,587,853 bp and 67.73% GC content. DW503 consists of 102 contigs of 6,458,767 bp and 67.64% GC content.
Petrov, Anton I.; Kay, Simon J.E.; Kalvari, Ioanna; Howe, Kevin L.; Gray, Kristian A.; Bruford, Elspeth A.; Kersey, Paul J.; Cochrane, Guy; Finn, Robert D.; Bateman, Alex; Kozomara, Ana; Griffiths-Jones, Sam; Frankish, Adam; Zwieb, Christian W.; Lau, Britney Y.; Williams, Kelly P.; Chan, Patricia P.; Lowe, Todd M.; Cannone, Jamie J.; Gutell, Robin R.; Machnicka, Magdalena A.; Bujnicki, Janusz M.; Yoshihama, Maki; Kenmochi, Naoya; Chai, Benli; Cole, James R.; Szymanski, Maciej; Karlowski, Wojciech M.; Wood, Valerie; Huala, Eva; Berardini, Tanya Z.; Zhao, Yi; Chen, Runsheng; Zhu, Weimin; Paraskevopoulou, Maria D.; Vlachos, Ioannis S.; Hatzigeorgiou, Artemis G.; Ma, Lina; Zhang, Zhang; Puetz, Joern; Stadler, Peter F.; McDonald, Daniel; Basu, Siddhartha; Fey, Petra; Engel, Stacia R.; Cherry, J.M.; Volders, Pieter J.; Mestdagh, Pieter; Wower, Jacek; Clark, Michael; Quek, Xiu C.; Dinger, Marcel E.
RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. The website has been subject to continuous improvements focusing on text and sequence similarity searches as well as genome browsing functionality. All RNAcentral data is provided for free and is available for browsing, bulk downloads, and programmatic access at http://rnacentral.org/.
When analyzing pathogen transcriptomes during the infection of host cells, the signal-to-background (pathogen-to-host) ratio of nucleic acids (NA) in infected samples is very small. Despite the advancements in next-generation sequencing, the minute amount of pathogen NA makes standard RNA-seq library preps inadequate for effective gene-level analysis of the pathogen in cases with low bacterial loads. In order to provide a more complete picture of the pathogen transcriptome during an infection, we developed a novel pathogen enrichment technique, which can enrich for transcripts from any cultivable bacteria or virus, using common, readily available laboratory equipment and reagents. To evenly enrich for pathogen transcripts, we generate biotinylated pathogen-targeted capture probes in an enzymatic process using the entire genome of the pathogen as a template. The capture probes are hybridized to a strand-specific cDNA library generated from an RNA sample. The biotinylated probes are captured on a monomeric avidin resin in a miniature spin column, and enriched pathogen-specific cDNA is eluted following a series of washes. To test this method, we performed an in vitro time-course infection using Klebsiella pneumoniae to infect murine macrophage cells. K. pneumoniae transcript enrichment efficiency was evaluated using RNA-seq. Bacterial transcripts were enriched up to ∼400-fold, and allowed the recovery of transcripts from ∼2000-3600 genes not observed in untreated control samples. These additional transcripts revealed interesting aspects of K. pneumoniae biology including the expression of putative virulence factors and the expression of several genes responsible for antibiotic resistance even in the absence of drugs.
Virulence genes on mobile DNAs such as genomic islands (GIs) and plasmids promote bacterial pathogen emergence. Excision is an early step in GI mobilization, producing a circular GI and a deletion site in the chromosome; circular forms are also known for some bacterial insertion sequences (ISs). The recombinant sequence at the junctions of such circles and deletions can be detected sensitively in high-throughput sequencing data, using new computational methods that enable empirical discovery of mobile DNAs. For the rich mobilome of a hospital Klebsiella pneumoniae strain, circularization junctions (CJs) were detected for six GIs and seven IS types. Our methods revealed differential biology of multiple mobile DNAs, imprecision of integrases and transposases, and differential activity among identical IS copies for IS26, ISKpn18 and ISKpn21. Using the resistance of circular dsDNA molecules to exonuclease, internally calibrated with the native plasmids, showed that not all molecules bearing GI CJs were circular. Transpositions were also detected, revealing replicon preference (ISKpn18 prefers a conjugative IncA/C2 plasmid), local action (IS26), regional preferences, selection (against capsule synthesis) and IS polarity inversion. Efficient discovery and global characterization of numerous mobile elements per experiment improves accounting for the new gene combinations that arise in emerging pathogens.
Carney, Laura T.; Wilkenfeld, Joshua S.; Lane, Pamela L.; Solberg, Owen D.; Fuqua, Zachary B.; Cornelius, Nina G.; Gillespie, Shaunette; Williams, Kelly P.; Samocha, Tzachi M.; Lane, Todd L.
Productivity of algal mass culture can be severely reduced by contaminating organisms. It is, therefore, important to identify contaminants, determine their effect on productivity and, ultimately, develop countermeasures against such contamination. In the present study we utilized microbiome analysis by second-generation sequencing of small subunit rRNA genes to characterize the predator and pathogen burden of open raceway cultures of Nannochloropsis salina. Samples were analyzed from replicate raceways before and after crashes. In one culture cycle, we identified two algivorous species, the rotifer Brachionus and gastrotrich Chaetonotus, the presence of which may have contributed to the loss of algal biomass. In the second culture cycle, the raceways were treated with hypochlorite in an unsuccessful attempt to interdict the crash. Our analyses were shown to be an effective strategy for the identification of the biological contaminants and the characterization of intervention strategies.
Here, we present the draft genome sequence of Burkholderia pseudomallei PHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. The draft genome consists of 39 contigs and is 7,322,181 bp long.
Yersinia enterocolitica is typically considered an extracellular pathogen; however, during the course of an infection, a significant number of bacteria are stably maintained within host cell vacuoles. Little is known about this population and the role it plays during an infection. To address this question and to elucidate the spatially and temporally dynamic gene expression patterns of Y. enterocoliticabiovar 1B through the course of an in vitro infection, transcriptome sequencing and differential gene expression analysis of bacteria infecting murine macrophage cells were performed under four distinct conditions. Bacteria were first grown in a nutrient-rich medium at 26°C to establish a baseline of gene expression that is unrelated to infection. The transcriptomes of these bacteria were then compared to bacteria grown in a conditioned cell culture medium at 37°C to identify genes that were differentially expressed in response to the increased temperature and medium but not in response to host cells. Infections were then performed, and the transcriptomes of bacteria found on the extracellular surface and intracellular compartments were analyzed individually. The upregulated genes revealed potential roles for a variety of systems in promoting intracellular virulence, including the Ysa type III secretion system, the Yts2 type II secretion system, and the Tad pilus. It was further determined that mutants of each of these systems had decreased virulence while infecting macrophages. Overall, these results reveal the complete set of genes expressed by Y. enterocolitica in response to infection and provide the groundwork for future virulence studies.
The transfer-messenger RNA (tmRNA) and its partner protein SmpB act together in resolving problems arising when translating bacterial ribosomes reach the end of mRNA with no stop codon. Their genes have been found in nearly all bacterial genomes and in some organelles. The tmRNA Website serves tmRNA sequences, alignments and feature annotations, and has recently moved to http://bioinformatics.sandia.gov/tmrna/. New features include software used to find the sequences, an update raising the number of unique tmRNA sequences from 492 to 1716, and a database of SmpB sequences which are served along with the tmRNA sequence from the same organism.