Computational Front End for Creating a Universal Phage Engineering Platform
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Analytical Methods
The discovery of the RNA-guided DNA nuclease CRISPR-Cas9 has enabled the targeted editing of genomes from diverse organisms, but the permanent and inheritable nature of genome modification also poses immense risks. The potential for accidental exposure, malicious use, or undesirable persistence of Cas9 therapeutics and off-target genome effects highlight the need for detection assays. Here we report a centrifugal microfluidic platform for the measurement of both Cas9 protein levels and nuclease activity. Because Cas9 from many bacterial species have been adapted for biotechnology applications, we developed the capability to detect Cas9 from the widely-used S. pyogenes, as well as S. aureus, N. meningitidis, and S. thermophilus using commercially-available antibodies. Further, we show that the phage-derived anti-CRISPR protein AcrIIC1, which binds to Cas9 from several species, can be used as a capture reagent to broaden the species range of detection. As genome modification generally requires Cas9 nuclease activity, a fluorescence-based sedimentation nuclease assay was also incorporated to allow the sensitive and simultaneous measurement of both Cas9 protein and activity in a single biological sample.
Abstract not provided.
Scientific Reports
Emerging sequencing technologies are allowing us to characterize environmental, clinical and laboratory samples with increasing speed and detail, including real-time analysis and interpretation of data. One example of this is being able to rapidly and accurately detect a wide range of pathogenic organisms, both in the clinic and the field. Genomes can have radically different GC content however, such that accurate sequence analysis can be challenging depending upon the technology used. Here, we have characterized the performance of the Oxford MinION nanopore sequencer for detection and evaluation of organisms with a range of genomic nucleotide bias. We have diagnosed the quality of base-calling across individual reads and discovered that the position within the read affects base-calling and quality scores. Finally, we have evaluated the performance of the current state-of-the-art neural network-based MinION basecaller, characterizing its behavior with respect to systemic errors as well as context- and sequence-specific errors. Overall, we present a detailed characterization the capabilities of the MinION in terms of generating high-accuracy sequence data from genomes with a wide range of nucleotide content. This study provides a framework for designing the appropriate experiments that are the likely to lead to accurate and rapid field-forward diagnostics.
Funded through the IHNS/E&HS investment area for FY16-18, the RAPIER LDRD sought to evaluate the potential benefits and applicability of the new Oxford MinION nanopore sequencer to pathogen diagnostic applications in biodefense, biosurveillance, and global/public health. The project had four primary objectives: 1) to investigate the performance of the MinION sequencer while building facility with its operation, 2) to develop microfluidic library prep automation facilitating the use of the MinION in field-forward or point-of-care applications, 3) to leverage CRISPR/Cas9 technology to enable targeted identification of bacterial pathogens, and 4) to capitalize on the real- time data output capabilities of the MinION to enable rapid sequence-based diagnostics. While the rapid evolution of the MinION sequencing technology during the course of the project posed a number of challenges and required a reassessment of initial project priorities, it also provided unique opportunities, notably culminating in our development of the RUBRIC real-time selective sequencing software.
Abstract not provided.
Analytical Chemistry
The RNA-guided DNA nuclease Cas9 is now widely used for the targeted modification of genomes of human cells and various organisms. Despite the extensive use of Clustered Regularly Interspaced Palindromic Repeats (CRISPR) systems for genome engineering and the rapid discovery and engineering of new CRISPR-associated nucleases, there are no high-throughput assays for measuring enzymatic activity. The current laboratory and future therapeutic uses of CRISPR technology have a significant risk of accidental exposure or clinical off-target effects, underscoring the need for therapeutically effective inhibitors of Cas9. Here, we develop a fluorescence assay for monitoring Cas9 nuclease activity and demonstrate its utility with S. pyogenes (Spy), S. aureus (Sau), and C. jejuni (Cje) Cas9. The assay was validated by quantitatively profiling the species specificity of published anti-CRISPR (Acr) proteins, confirming the reported inhibition of Spy Cas9 by AcrIIA4 and Cje Cas9 by AcrIIC1 and no inhibition of Sau Cas9 by either anti-CRISPR. To identify drug-like inhibitors, we performed a screen of 189 606 small molecules for inhibition of Spy Cas9. Of 437 hits (0.2% hit rate), six were confirmed as Cas9 inhibitors in a direct gel electrophoresis secondary assay. The high-throughput nature of this assay makes it broadly applicable for the discovery of additional Cas9 inhibitors or the characterization of Cas9 enzyme variants.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Nucleic Acids Research
Virulence genes on mobile DNAs such as genomic islands (GIs) and plasmids promote bacterial pathogen emergence. Excision is an early step in GI mobilization, producing a circular GI and a deletion site in the chromosome; circular forms are also known for some bacterial insertion sequences (ISs). The recombinant sequence at the junctions of such circles and deletions can be detected sensitively in high-throughput sequencing data, using new computational methods that enable empirical discovery of mobile DNAs. For the rich mobilome of a hospital Klebsiella pneumoniae strain, circularization junctions (CJs) were detected for six GIs and seven IS types. Our methods revealed differential biology of multiple mobile DNAs, imprecision of integrases and transposases, and differential activity among identical IS copies for IS26, ISKpn18 and ISKpn21. Using the resistance of circular dsDNA molecules to exonuclease, internally calibrated with the native plasmids, showed that not all molecules bearing GI CJs were circular. Transpositions were also detected, revealing replicon preference (ISKpn18 prefers a conjugative IncA/C2 plasmid), local action (IS26), regional preferences, selection (against capsule synthesis) and IS polarity inversion. Efficient discovery and global characterization of numerous mobile elements per experiment improves accounting for the new gene combinations that arise in emerging pathogens.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Genome Announcements
Here, we present the draft genome sequence of Burkholderia pseudomallei PHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. The draft genome consists of 39 contigs and is 7,322,181 bp long.
Abstract not provided.
Abstract not provided.
Abstract not provided.
We have implemented a ligand-alignment algorithm into our developed computational pipeline for identifying specificity-determining features (SDFs) in protein-ligand complexes. Given a set of protein-ligand complex structures, the algorithm aligns the complexes by ligand rather than by the C -RMSD or standard approach, providing a single reference frame for extracting SDFs. We anticipate that this ligand-alignment capability will be highly useful for protein function prediction. We already have a database containing > 20 K ligand-protein complex crystal structures taken from the Protein Data Bank. By aligning these proteins to single reference frames using ligand alignment, we can submit the complexes to our pipeline for SDF extraction. The SDFs derived from this training procedure can be used as thumbprints that are hallmarks of individual enzyme classes. These SDF thumbprints may then serve as guides to the prediction of function of new unknown proteins.
Journal of the American Chemical Society
Abstract not provided.
PLoS ONE
Next-generation sequencing (NGS) is emerging as a powerful tool for elucidating genetic information for a wide range of applications. Unfortunately, the surging popularity of NGS has not yet been accompanied by an improvement in automated techniques for preparing formatted sequencing libraries. To address this challenge, we have developed a prototype microfluidic system for preparing sequencer-ready DNA libraries for analysis by Illumina sequencing. Our system combines droplet-based digital microfluidic (DMF) sample handling with peripheral modules to create a fully-integrated, sample-in library-out platform. In this report, we use our automated system to prepare NGS libraries from samples of human and bacterial genomic DNA. E. coli libraries prepared on-device from 5 ng of total DNA yielded excellent sequence coverage over the entire bacterial genome, with >99% alignment to the reference genome, even genome coverage, and good quality scores. Furthermore, we produced a de novo assembly on a previously unsequenced multi-drug resistant Klebsiella pneumoniae strain BAA-2146 (KpnNDM). The new method described here is fast, robust, scalable, and automated. Our device for library preparation will assist in the integration of NGS technology into a wide variety of laboratories, including small research laboratories and clinical laboratories. © 2013 Kim et al.
RNA Biology
Abstract not provided.
Biotechniques
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Nucleic Acid Research
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Sandia's scientific and engineering expertise in the fields of computational biology, high-performance prosthetic limbs, biodetection, and bioinformatics has been applied to specific problems at the forefront of cancer research. Molecular modeling was employed to design stable mutations of the enzyme L-asparaginase with improved selectivity for asparagine over other amino acids with the potential for improved cancer chemotherapy. New electrospun polymer composites with improved electrical conductivity and mechanical compliance have been demonstrated with the promise of direct interfacing between the peripheral nervous system and the control electronics of advanced prosthetics. The capture of rare circulating tumor cells has been demonstrated on a microfluidic chip produced with a versatile fabrication processes capable of integration with existing lab-on-a-chip and biosensor technology. And software tools have been developed to increase the calculation speed of clustered heat maps for the display of relationships in large arrays of protein data. All these projects were carried out in collaboration with researchers at the University of Texas M. D. Anderson Cancer Center in Houston, TX.
Abstract not provided.
We have developed a novel modular automated processing system (MAPS) that enables reliable, high-throughput analysis as well as sample-customized processing. This system is comprised of a set of independent modules that carry out individual sample processing functions: cell lysis, protein concentration (based on hydrophobic, ion-exchange and affinity interactions), interferent depletion, buffer exchange, and enzymatic digestion of proteins of interest. Taking advantage of its unique capacity for enclosed processing of intact bioparticulates (viruses, spores) and complex serum samples, we have used MAPS for analysis of BSL1 and BSL2 samples to identify specific protein markers through integration with the portable microChemLab{trademark} and MALDI.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Abstract not provided.
Bioweapons and emerging infectious diseases pose formidable and growing threats to our national security. Rapid advances in biotechnology and the increasing efficiency of global transportation networks virtually guarantee that the United States will face potentially devastating infectious disease outbreaks caused by novel ('unknown') pathogens either intentionally or accidentally introduced into the population. Unfortunately, our nation's biodefense and public health infrastructure is primarily designed to handle previously characterized ('known') pathogens. While modern DNA assays can identify known pathogens quickly, identifying unknown pathogens currently depends upon slow, classical microbiological methods of isolation and culture that can take weeks to produce actionable information. In many scenarios that delay would be costly, in terms of casualties and economic damage; indeed, it can mean the difference between a manageable public health incident and a full-blown epidemic. To close this gap in our nation's biodefense capability, we will develop, validate, and optimize a system to extract nucleic acids from unknown pathogens present in clinical samples drawn from infected patients. This system will extract nucleic acids from a clinical sample, amplify pathogen and specific host response nucleic acid sequences. These sequences will then be suitable for ultra-high-throughput sequencing (UHTS) carried out by a third party. The data generated from UHTS will then be processed through a new data assimilation and Bioinformatic analysis pipeline that will allow us to characterize an unknown pathogen in hours to days instead of weeks to months. Our methods will require no a priori knowledge of the pathogen, and no isolation or culturing; therefore it will circumvent many of the major roadblocks confronting a clinical microbiologist or virologist when presented with an unknown or engineered pathogen.
Abstract not provided.
Abstract not provided.
We present concepts for self-assembly of diffractive optics with potential uses in biosensors and biocomputers. The simplest such optics, diffraction gratings, can potentially be made from chemically-stabilized microtubules migrating on nanopatterned tracks of the motor protein kinesin. We discuss the fabrication challenges involved in patterning sub-micron-scale structures with proteins that must be maintained in aqueous buffers to preserve their activity. A novel strategy is presented that employs dry contact printing onto glass-supported amino-silane monolayers of heterobifunctional crosslinkers, followed by solid-state reactions of these cross-linkers, to graft patterns of reactive groups onto the surface. Successive solution-phase addition of cysteine-mutant proteins and amine-reactive polyethylene glycol allows assembly of features onto the printed patterns. We present data from initial experiments showing successful micro- and nanopatterning of lines of single-cysteine mutants of kinesin interleaved with lines of polyethylene, indicating that this strategy can be employed to arrays of features with resolutions suitable for gratings.
Intra-molecular cross-linking has been suggested as a method of obtaining distance constraints that would be useful in developing structural models of proteins. Recent work published on intra-molecular cross-linking for protein structural studies has employed commercially available primary amine selective reagents that can cross-link lysine residues to other lysine residues or the amino terminus. Previous work using these cross-linkers has shown that for several proteins of known structure, the number of cross-links that can be obtained experimentally may be small compared to what would be expected from the known structure, due to the relative reactivity, distribution, and solvent accessibility of the lysines in the protein sequence. To overcome these limitations we have investigated the use of cross-linking reagents that can react with other reactive sidechains in proteins. We used 1-Ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC) to activate the carboxylic acid containing residues, aspartic acid (D), glutamic acid (E), and the carboxy terminus (O), for cross-linking reactions. Once activated, the DEO sidechains can react to form 'zero-length' cross-links with nearby primary amine containing resides, lysines (K) and the amino terminus (X), via the formation of a new amide bond. We also show that the EDC-activated DEO sidechains can be cross-linked to each other using dihydrazides, two hydrazide moieties connected by an alkyl cross-linker ann of variable length. Using these reagents, we have found three new 'zero-length' cross-links in ubiquitin consistent with its known structure (M1-E16, M1-E18, and K63-E64). Using the dihydrazide cross-linkers, we have identified 2 new cross-links (D21-D32 and E24-D32) unambiguously. Using a library of dihydrazide cross-linkers with varying arm length, we have shown that there is a minimum arm length required for the DEO-DEO cross-links of 5.8 angstroms. These results show that additional structural information can be obtained by exploiting new cross-linker chemistry, increasing the probability that the protein target of choice will yield sufficient distance constraints to develop a structural model.
Membrane proteins make up a diverse and important subset of proteins for which structural information is limited. In this study, chemical cross-linking and mass spectrometry were used to explore the structure of the G-protein-coupled photoreceptor bovine rhodopsin in the dark-state conformation. All experiments were performed in rod outer segment membranes using amino acid 'handles' in the native protein sequence and thus minimizing perturbations to the native protein structure. Cysteine and lysine residues were covalently cross-linked using commercially available reagents with a range of linker arm lengths. Following chemical digestion of cross-linked protein, cross-linked peptides were identified by accurate mass measurement using liquid chromatography-fourier transform mass spectrometry and an automated data analysis pipeline. Assignments were confirmed and, if necessary, resolved, by tandem MS. The relative reactivity of lysine residues participating in cross-links was evaluated by labeling with NHS-esters. A distinct pattern of cross-link formation within the C-terminal domain, and between loop I and the C-terminal domain, emerged. Theoretical distances based on cross-linking were compared to inter-atomic distances determined from the energy-minimized X-ray crystal structure and Monte Carlo conformational search procedures. In general, the observed cross-links can be explained by re-positioning participating side-chains without significantly altering backbone structure. One exception, between C3 16 and K325, requires backbone motion to bring the reactive atoms into sufficient proximity for cross-linking. Evidence from other studies suggests that residues around K325 for a region of high backbone mobility. These findings show that cross-linking studies can provide insight into the structural dynamics of membrane proteins in their native environment.
Clostridial neurotoxins, such as botulinum and tetanus, are generally thought to invade neural cells through a process of high affinity binding mediated by gangliosides, internalization via endosome formation, and subsequent membrane penetration of the catalytic domain activated by a pH drop in the endosome. This surface recognition and internalization process is still not well understood with regard to what specific membrane features the toxins target, the intermolecular interactions between bound toxins, and the molecular conformational changes that occur as a result of pH lowering. In an effort to elucidate the mechanism of tetanus toxin binding and permeation through the membrane a simple yet representative model was developed that consisted of the ganglioside G{sub tlb} incorporated in a bilayer of cholesterol and DPPC (dipalmitoylphosphatidyl choline). The bilayers were stable over time yet sensitive towards the binding and activity of whole toxin. A liposome leakage study at constant pH as well as with a pH gradient, to mimic the processes of the endosome, was used to elucidate the effect of pH on the toxin's membrane binding and permeation capability. Topographic imaging of the membrane surface, via in situ tapping mode AFM, provided nanoscale characterization of the toxin's binding location and pore formation activity.
We have developed a novel approach to modeling the transmembrane spanning helical bundles of integral membrane proteins using only a sparse set of distance constraints, such as those derived from MS3-D, dipolar-EPR and FRET experiments. Algorithms have been written for searching the conformational space of membrane protein folds matching the set of distance constraints, which provides initial structures for local conformational searches. Local conformation search is achieved by optimizing these candidates against a custom penalty function that incorporates both measures derived from statistical analysis of solved membrane protein structures and distance constraints obtained from experiments. This results in refined helical bundles to which the interhelical loops and amino acid side-chains are added. Using a set of only 27 distance constraints extracted from the literature, our methods successfully recover the structure of dark-adapted rhodopsin to within 3.2 {angstrom} of the crystal structure.
Protein Science
We present a two-step approach to modeling the transmembrane spanning helical bundles of integral membrane proteins using only sparse distance constraints, such as those derived from chemical cross-linking, dipolar EPR and FRET experiments. In Step 1, using an algorithm, we developed, the conformational space of membrane protein folds matching a set of distance constraints is explored to provide initial structures for local conformational searches. In Step 2, these structures refined against a custom penalty function that incorporates both measures derived from statistical analysis of solved membrane protein structures and distance constraints obtained from experiments. We begin by describing the statistical analysis of the solved membrane protein structures from which the theoretical portion of the penalty function was derived. We then describe the penalty function, and, using a set of six test cases, demonstrate that it is capable of distinguishing helical bundles that are close to the native bundle from those that are far from the native bundle. Finally, using a set of only 27 distance constraints extracted from the literature, we show that our method successfully recovers the structure of dark-adapted rhodopsin to within 3.2 Å of the crystal structure.
A deterministic algorithm for enumeration of transmembrane protein folds is presented. Using a set of sparse pairwise atomic distance constraints (such as those obtained from chemical cross-linking, FRET, or dipolar EPR experiments), the algorithm performs an exhaustive search of secondary structure element packing conformations distributed throughout the entire conformational space. The end result is a set of distinct protein conformations, which can be scored and refined as part of a process designed for computational elucidation of transmembrane protein structures.
Abstract not provided.
This report summarizes the development of sensor particles for remote detection of trace chemical analytes over broad areas, e.g residual trinitrotoluene from buried landmines or other unexploded ordnance (UXO). We also describe the potential of the sensor particle approach for the detection of chemical warfare (CW) agents. The primary goal of this work has been the development of sensor particles that incorporate sample preconcentration, analyte molecular recognition, chemical signal amplification, and fluorescence signal transduction within a ''grain of sand''. Two approaches for particle-based chemical-to-fluorescence signal transduction are described: (1) enzyme-amplified immunoassays using biocompatible inorganic encapsulants, and (2) oxidative quenching of a unique fluorescent polymer by TNT.