Publications

4 Results
Skip to search filters

Hyperparameter Setting for a Marked Multidimensional Hawkes Process with Dissimilar Decays

Sena, Mary R.; Jones, Jessica L.

We provide further details for using Lim, et al.'s marked multidimensional Hawkes processes with dissimilar decays. We first describe what makes these different from other Hawkes processes, then describe each model hyperparameter and how to initialize it informed by the input data and any prior biases. We derived tighter bounds than Lim, et al. for faster convergence of rejection sampling. The resulting hyperparameters and bounds have been helpful against both synthetic and real-world datasets.

More Details

Utilizing reinforcement learning to continuously improve a primitive-based motion planner

AIAA Scitech 2021 Forum

Goddard, Zachary C.; Wardlaw, Kenneth; Krishnan, Rohith; Tsiotras, Panagiotis; Smith, Michael R.; Sena, Mary R.; Parish, Julie M.; Mazumdar, Anirban

This paper describes how the performance of motion primitive based planning algorithms can be improved using reinforcement learning. Specifically, we describe and evaluate a framework for policy improvement via the discovery of new motion primitives. Our approach combines the predictable behavior of deterministic planning methods with the exploration capability of reinforcement learning. The framework consists of three phases: evaluation, exploration, and extraction. This framework can be iterated continuously to provide successive improvement. The evaluation step scores the performance of a motion primitive library using value iteration to create a cost map. A local difference metric is then used to identify regions that need improvement. The exploration step utilizes reinforcement learning to examine new trajectories in the regions of greatest need. The extraction step encodes the agent’s experiences into new primitives. The framework is tested on a point-to-point navigation task using a 6DOF nonlinear F-16 model. One iteration of the framework discovered 17 new primitives and provided a maximum planning time reduction of 96.91%. After 3 full iterations, 123 primitives were added with a maximum time reduction of 97.39%. The proposed framework is easily extensible to a range of vehicles, environments, and cost functions.

More Details
4 Results
4 Results