Proceedings of the International Joint Conference on Neural Networks
Green, Sam G.; Vineyard, Craig M.; Helinski, Ryan; Koc, Cetin K.
Early neural network architectures were designed by so-called grad student descent. Since then, the field of Neural Architecture Search (NAS) has developed with the goal of algorithmically designing architectures tailored for a dataset of interest. Recently, gradient-based NAS approaches have been created to rapidly perform the search. Gradient-based approaches impose more structure on the search, compared to alternative NAS methods, enabling faster search phase optimization. In the real-world, neural architecture performance is measured by more than just high accuracy. There is increasing need for efficient neural architectures, where resources such as model size or latency must also be considered. Gradient-based NAS is also suitable for such multi-objective optimization. In this work, we extend a popular gradient-based NAS method to support one or more resource costs. We then perform in-depth analysis on the discovery of architectures satisfying single-resource constraints for classification of CIFAR-10.
Remote sensing (RS) data collection capabilities are rapidly evolving hyper-spectrally (sensing more spectral bands), hyper-temporally (faster sampling rates) and hyper-spatially (increasing number of smaller pixels). Accordingly, sensor technologies have outpaced transmission capa- bilities introducing a need to process more data at the sensor. While many sophisticated data processing capabilities are emerging, power and other hardware requirements for these approaches on conventional electronic systems place them out of context for resource constrained operational environments. To address these limitations, in this research effort we have investigated and char- acterized neural-inspired architectures to determine suitability for implementing RS algorithms In doing so, we have been able to highlight a 100x performance per watt improvement using neu- romorphic computing as well as developed an algorithmic architecture co-design and exploration capability.
A hybrid analogue–digital computing system based on memristive devices is capable of solving classic control problems with potentially a lower energy consumption and higher speed than fully digital systems.
Deep neural networks are often computationally expensive, during both the training stage and inference stage. Training is always expensive, because back-propagation requires high-precision floating-pointmultiplication and addition. However, various mathematical optimizations may be employed to reduce the computational cost of inference. Optimized inference is important for reducing power consumption and latency and for increasing throughput. This chapter introduces the central approaches for optimizing deep neural network inference: pruning "unnecessary" weights, quantizing weights and inputs, sharing weights between layer units, compressing weights before transferring from main memory, distilling large high-performance models into smaller models, and decomposing convolutional filters to reduce multiply and accumulate operations. In this chapter, using a unified notation, we provide a mathematical and algorithmic description of the aforementioned deep neural network inference optimization methods.
Deep neural networks (DNN) now outperform competing methods in many academic and industrial domains. These high-capacity universal function approximators have recently been leveraged by deep reinforcement learning (RL) algorithms to obtain impressive results for many control and decision making problems. During the past three years, research toward pruning, quantization, and compression of DNNs has reduced the mathematical, and therefore time and energy, requirements of DNN-based inference. For example, DNN optimization techniques have been developed which reduce storage requirements of VGG-16 from 552MB to 11.3MB, while maintaining the full-model accuracy for image classification. Building from DNN optimization results, the computer architecture community is taking increasing interest in exploring DNN hardware accelerator designs. Based on recent deep RL performance, we expect hardware designers to begin considering architectures appropriate for accelerating these algorithms too. However, it is currently unknown how, when, or if the 'noise' introduced by DNN optimization techniques will degrade deep RL performance. This work measures these impacts, using standard OpenAI Gym benchmarks. Our results show that mathematically optimized RL policies can perform equally to full-precision RL, while requiring substantially less computation. We also observe that different optimizations are better suited than others for different problem domains. By beginning to understand the impacts of mathematical optimizations on RL policy performance, this work serves as a starting point toward the development of low power or high performance deep RL accelerators.
Select one or more publication years and click "Update search results".
This list has already been filtered by scope and author.
SELECTED PUBLICATION YEARS
MATCHING PUBLICATION YEARS
ALL PUBLICATION YEARS
No matches found.
Select a document type
Select one or more document types and click "Update search results".
This list has already been filtered by scope and author.
SELECTED DOCUMENT TYPES
MATCHING DOCUMENT TYPES
ALL DOCUMENT TYPES
No matches found.
Search for an author
Search for a Sandian author by first name, last name, or initials. Click on the author's name to add them as an option, and then click "Update search results".
This list has already been filtered by scope and author.
SELECTED AUTHORS
MATCHING AUTHORS
ALL AUTHORS
No matches found.
Search for a funding sponsor
Search for one or more funding sponsors and click "Update search results".
This list has already been filtered by scope and author.
SELECTED FUNDING SPONSORS
MATCHING FUNDING SPONSORS
ALL FUNDING SPONSORS
No matches found.
Search for a research partner
Search for one or more research partners and click "Update search results".
This list has already been filtered by scope and author.
SELECTED RESEARCH PARTNERS
MATCHING RESEARCH PARTNERS
ALL RESEARCH PARTNERS
No matches found.
Search for a subject
Search for one or more subjects and click "Update search results".
This list has already been filtered by scope and author.