Publications

75 Results
Skip to search filters

A statistical approach to combining multisource information in one-class classifiers

Statistical Analysis and Data Mining

Simonson, Katherine M.; West, Roger D.; Hansen, Ross L.; LaBruyere, Thomas E.; Van Benthem, Mark V.

A new method is introduced for combining information from multiple sources to support one-class classification. The contributing sources may represent measurements taken by different sensors of the same physical entity, repeated measurements by a single sensor, or numerous features computed from a single measured image or signal. The approach utilizes the theory of statistical hypothesis testing, and applies Fisher's technique for combining p-values, modified to handle nonindependent sources. Classifier outputs take the form of fused p-values, which may be used to gauge the consistency of unknown entities with one or more class hypotheses. The approach enables rigorous assessment of classification uncertainties, and allows for traceability of classifier decisions back to the constituent sources, both of which are important for high-consequence decision support. Application of the technique is illustrated in two challenge problems, one for skin segmentation and the other for terrain labeling. The method is seen to be particularly effective for relatively small training samples.

More Details

Materials assurance through orthogonal materials measurements: X-ray fluorescence aspects

Powder Diffraction

Rodriguez, Mark A.; Van Benthem, Mark V.; Susan, D.F.; Griego, James J.M.; Yang, Pin Y.; Mowry, Curtis D.; Enos, David E.

X-ray fluorescence (XRF) has been employed as one of several orthogonal means of screening materials to prevent counterfeit and adulterated products from entering the product stream. We document the use of principal component analysis (PCA) of XRF data on compositionally similar and dissimilar stainless steels for the purpose of testing the feasibility of employing XRF spectra to parse and bin these alloys as the same or significantly different alloy materials. The results indicate that XRF spectra can separate and assign alloys via PCA, but that important corrections for detector drift and scaling must be performed in order to achieve valid results.

More Details

Effect of Hydrogen and Oxygen Partial Pressure on the Tribochemistry of Silicon Oxide-Containing Hydrogenated Amorphous Carbon

Mangolini, Filippo M.; Mangolini, Filippo M.; Koshigan, Komlavi D.; Koshigan, Komlavi D.; Van Benthem, Mark V.; Van Benthem, Mark V.; Ohlhausen, J.A.; Ohlhausen, J.A.; McClimon, J B.; McClimon, J B.; Hilbert, James A.; Hilbert, James A.; Fontaine, Julien F.; Fontaine, Julien F.; Carpick, Robert W.; Carpick, Robert W.

Abstract not provided.

Trusted materials using orthogonal testing. 2015 Annual report

Van Benthem, Mark V.

The purpose of this project is to prove (or disprove) that a reasonable number of simple tests can be used to provide a unique data signature for materials, changes in which could serve as a harbinger of material deviation, prompting further evaluations. The routine tests are mutually orthogonal to any currently required materials specification tests.

More Details

An evaluation of algorithms and methods for compressing and decompressing atmospheric transmission data for use in at-sensor measurements

Proceedings of SPIE - The International Society for Optical Engineering

Van Benthem, Mark V.; Woodbury, Drew P.

In this paper, we describe the use of various methods of one-dimensional spectral compression by variable selection as well as principal component analysis (PCA) for compressing multi-dimensional sets of spectral data. We have examined methods of variable selection such as wavelength spacing, spectral derivatives, and spectral integration error. After variable selection, reduced transmission spectra must be decompressed for use. Here we examine various methods of interpolation, e.g., linear, cubic spline and piecewise cubic Hermite interpolating polynomial (PCHIP) to recover the spectra prior to estimating at-sensor radiance. Finally, we compressed multi-dimensional sets of spectral transmittance data from moderate resolution atmospheric transmission (MODTRAN) data using PCA. PCA seeks to find a set of basis spectra (vectors) that model the variance of a data matrix in a linear additive sense. Although MODTRAN data are intricate and are used in nonlinear modeling, their base spectra can be reasonably modeled using PCA yielding excellent results in terms of spectral reconstruction and estimation of at-sensor radiance. The major finding of this work is that PCA can be implemented to compress MODTRAN data with great effect, reducing file size, access time and computational burden while producing high-quality transmission spectra for a given set of input conditions.

More Details

Tensor analysis methods for activity characterization in spatiotemporal data

Haass, Michael J.; Van Benthem, Mark V.; Ochoa, Edward M.

Tensor (multiway array) factorization and decomposition offers unique advantages for activity characterization in spatio-temporal datasets because these methods are compatible with sparse matrices and maintain multiway structure that is otherwise lost in collapsing for regular matrix factorization. This report describes our research as part of the PANTHER LDRD Grand Challenge to develop a foundational basis of mathematical techniques and visualizations that enable unsophisticated users (e.g. users who are not steeped in the mathematical details of matrix algebra and mulitway computations) to discover hidden patterns in large spatiotemporal data sets.

More Details

Multivariate analysis of progressive thermal desorption coupled gas chromatography-mass spectrometry

Van Benthem, Mark V.; Borek, Theodore T.; Mowry, Curtis D.; Kotula, Paul G.

Thermal decomposition of poly dimethyl siloxane compounds, Sylgard{reg_sign} 184 and 186, were examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a method of producing multiway data using a stepped thermal desorption. The technique involves sequentially heating a sample of the material of interest with subsequent analysis in a commercial GC/MS system. The decomposition chromatograms were analyzed using multivariate analysis tools including principal component analysis (PCA), factor rotation employing the varimax criterion, and multivariate curve resolution. The results of the analysis show seven components related to offgassing of various fractions of siloxanes that vary as a function of temperature. Thermal desorption coupled with gas chromatography-mass spectrometry (TD/GC-MS) is a powerful analytical technique for analyzing chemical mixtures. It has great potential in numerous analytic areas including materials analysis, sports medicine, in the detection of designer drugs; and biological research for metabolomics. Data analysis is complicated, far from automated and can result in high false positive or false negative rates. We have demonstrated a step-wise TD/GC-MS technique that removes more volatile compounds from a sample before extracting the less volatile compounds. This creates an additional dimension of separation before the GC column, while simultaneously generating three-way data. Sandia's proven multivariate analysis methods, when applied to these data, have several advantages over current commercial options. It also has demonstrated potential for success in finding and enabling identification of trace compounds. Several challenges remain, however, including understanding the sources of noise in the data, outlier detection, improving the data pretreatment and analysis methods, developing a software tool for ease of use by the chemist, and demonstrating our belief that this multivariate analysis will enable superior differentiation capabilities. In addition, noise and system artifacts challenge the analysis of GC-MS data collected on lower cost equipment, ubiquitous in commercial laboratories. This research has the potential to affect many areas of analytical chemistry including materials analysis, medical testing, and environmental surveillance. It could also provide a method to measure adsorption parameters for chemical interactions on various surfaces by measuring desorption as a function of temperature for mixtures. We have presented results of a novel method for examining offgas products of a common PDMS material. Our method involves utilizing a stepped TD/GC-MS data acquisition scheme that may be almost totally automated, coupled with multivariate analysis schemes. This method of data generation and analysis can be applied to a number of materials aging and thermal degradation studies.

More Details

In situ analysis of LiFePO4 batteries: Signal extraction by multivariate analysis

Powder Diffraction

Rodriguez, Marko A.; Van Benthem, Mark V.; Ingersoll, David I.; Vogel, Sven C.; Reiche, Helmut M.

The electrochemical reaction behavior of a commercial Li-ion battery (FePO4-based cathode, graphite-based anode) has been measured via in situ neutron diffraction. A multivariate analysis was successfully applied to the neutron diffraction data set facilitating in the determination of Li bearing phases participating in the electrochemical reaction in both the anode and cathode as a function of state-of-charge (SOC). The analysis resulted in quantified phase fraction values for FePO4 and FePO4 cathode compounds as well as the identification of staging behavior of Li6, Li12, Li24, and graphite phases in the anode. An additional Li-graphite phase has also been tentatively identified during electrochemical cycling as LiC48 at conditions of ∼5% to 15% SOC. © 2010 International Centre for Diffraction Data.

More Details

Final report : multicomponent forensic signature development : interactions with common textiles; mustard precursors and simulants

Van Benthem, Mark V.; Borek, Theodore T.; Mowry, Curtis D.; Kotula, Paul G.

2-Chloroethyl phenyl sulfide (CEPS), a surrogate compound of the chemical warfare agent sulfur mustard, was examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a novel method of producing multiway data using a stepped thermal desorption. Various multivariate analysis schemes were employed to analyze the data. These methods may be able to discern different sources of CEPS. In addition, CEPS was applied to cotton, nylon, polyester, and silk swatches. These swatches were placed in controlled humidity chambers maintained at 23%, 56%, and 85% relative humidity. At regular intervals, samples were removed from each test swatch, and the samples analyzed using TD/GC-MS. The results were compared across fabric substrate and humidity.

More Details

Hyperspectral imaging of oil producing microalgae under thermal and nutritional stress

Powell, Amy J.; Davis, Ryan W.; Lane, Todd L.; Lane, Pamela L.; Keenan, Michael R.; Van Benthem, Mark V.

This short-term, late-start LDRD examined the effects of nutritional deprivation on the energy harvesting complex in microalgae. While the original experimental plan involved a much more detailed study of temperature and nutrition on the antenna system of a variety of TAG producing algae and their concomitant effects on oil production, time and fiscal constraints limited the scope of the study. This work was a joint effort between research teams at Sandia National Laboratories, New Mexico and California. Preliminary results indicate there is a photosystem response to silica starvation in diatoms that could impact the mechanisms for lipid accumulation.

More Details

Weighting hyperspectral image data for improved multivariate curve resolution results

Journal of Chemometrics

Jones, Howland D.; Haaland, David M.; Sinclair, Michael B.; Melgaard, David K.; Van Benthem, Mark V.; Pedroso, M.C.

The combination of hyperspectral confocal fluorescence microscopy and multivariate curve resolution (MCR) provides an ideal system for improved quantitative imaging when multiple fluorophores are present. However, the presence of multiple noise sources limits the ability of MCR to accurately extract pure-component spectra when there is high spectral and/or spatial overlap between multiple fluorophores. Previously, MCR results were improved by weighting the spectral images for Poisson-distributed noise, but additional noise sources are often present. We have identified and quantified all the major noise sources in hyperspectral fluorescence images. Two primary noise sources were found: Poisson-distributed noise and detector-read noise. We present methods to quantify detector-read noise variance and to empirically determine the electron multiplying CCD (EMCCD) gain factor required to compute the Poisson noise variance. We have found that properly weighting spectral image data to account for both noise sources improved MCR accuracy. In this paper, we demonstrate three weighting schemes applied to a real hyperspectral corn leaf image and to simulated data based upon this same image. MCR applied to both real and simulated hyperspectral images weighted to compensate for the two major noise sources greatly improved the extracted pure emission spectra and their concentrations relative to MCR with either unweighted or Poisson-only weighted data. Thus, properly identifying and accounting for the major noise sources in hyperspectral images can serve to improve the MCR results. These methods are very general and can be applied to the multivariate analysis of spectral images whenever CCD or EMCCD detectors are used. Copyright © 2008 John Wiley & Sons, Ltd.

More Details

3D optical sectioning with a new hyperspectral confocal fluorescence imaging system

Haaland, David M.; Sinclair, Michael B.; Jones, Howland D.; Timlin, Jerilyn A.; Bachand, George B.; Sasaki, Darryl Y.; Davidson, George S.; Van Benthem, Mark V.

A novel hyperspectral fluorescence microscope for high-resolution 3D optical sectioning of cells and other structures has been designed, constructed, and used to investigate a number of different problems. We have significantly extended new multivariate curve resolution (MCR) data analysis methods to deconvolve the hyperspectral image data and to rapidly extract quantitative 3D concentration distribution maps of all emitting species. The imaging system has many advantages over current confocal imaging systems including simultaneous monitoring of numerous highly overlapped fluorophores, immunity to autofluorescence or impurity fluorescence, enhanced sensitivity, and dramatically improved accuracy, reliability, and dynamic range. Efficient data compression in the spectral dimension has allowed personal computers to perform quantitative analysis of hyperspectral images of large size without loss of image quality. We have also developed and tested software to perform analysis of time resolved hyperspectral images using trilinear multivariate analysis methods. The new imaging system is an enabling technology for numerous applications including (1) 3D composition mapping analysis of multicomponent processes occurring during host-pathogen interactions, (2) monitoring microfluidic processes, (3) imaging of molecular motors and (4) understanding photosynthetic processes in wild type and mutant Synechocystis cyanobacteria.

More Details

Multivariate curve resolution for hyperspectral image analysis: Applications to microarray technology

Proceedings of SPIE - The International Society for Optical Engineering

Haaland, David M.; Timlin, Jerilyn A.; Sinclair, Michael B.; Van Benthem, Mark V.; Martinez, M.J.; Aragon, Anthony D.; Werner-Washburne, Margaret

Multivariate curve resolution (MCR) using constrained alternating least squares algorithms represents a powerful analysis capability for the quantitative analysis of hyperspectral image data. We will demonstrate the application of MCR using data from a new hyperspectral fluorescence imaging microarray scanner for monitoring gene expression in cells from thousands of genes on the array. The new scanner collects the entire fluorescence spectrum from each pixel of the scanned microarray. Application of MCR with nonnegativity and equality constraints reveals several sources of undesired fluorescence that emit in the same wavelength range as the reporter fluorophores. MCR analysis of the hyperspectral images confirms that one of the sources of fluorescence is due to contaminant fluorescence under the printed DNA spots that is spot localized. Thus, traditional background subtraction methods used with data collected from the current commercial microarray scanners will lead to errors in determining the relative expression of low-expressed genes. With the new scanner and MCR analysis, we generate relative concentration maps of the background, impurity, and fluorescent labels over the entire image. Since the concentration maps of the fluorescent labels are relatively unaffected by the presence of background and impurity emissions, the accuracy and useful dynamic range of the gene expression data are both greatly improved over those obtained by commercial microarray scanners.

More Details

Reducing System Artifacts in Hyperspectral Image Data Analysis with the Use of Estimates of the Error Covariance in the Data

Haaland, David M.; Van Benthem, Mark V.

Hyperspectral Fourier transform infrared images have been obtained from a neoprene sample aged in air at elevated temperatures. The massive amount of spectra available from this heterogeneous sample provides the opportunity to perform quantitative analysis of the spectral data without the need for calibration standards. Multivariate curve resolution (MCR) methods with non-negativity constraints applied to the iterative alternating least squares analysis of the spectral data has been shown to achieve the goal of quantitative image analysis without the use of standards. However, the pure-component spectra and the relative concentration maps were heavily contaminated by the presence of system artifacts in the spectral data. We have demonstrated that the detrimental effects of these artifacts can be minimized by adding an estimate of the error covariance structure of the spectral image data to the MCR algorithm. The estimate is added by augmenting the concentration and pure-component spectra matrices with scores and eigenvectors obtained from the mean-centered repeat image differences of the sample. The implementation of augmentation is accomplished by employing efficient equality constraints on the MCR analysis. Augmentation with the scores from the repeat images is found to primarily improve the pure-component spectral estimates while augmentation with the corresponding eigenvectors primarily improves the concentration maps. Augmentation with both scores and eigenvectors yielded the best result by generating less noisy pure-component spectral estimates and relative concentration maps that were largely free from a striping artifact that is present due to system errors in the FT-IR images. The MCR methods presented are general and can also be applied productively to non-image spectral data.

More Details
75 Results
75 Results