Safe Approximate Dynamic Programming via Kernelized Lipschitz Estimation.

08:00 EDT 19th March 2020 | BioPortfolio

Summary of "Safe Approximate Dynamic Programming via Kernelized Lipschitz Estimation."

We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ the kernelized Lipschitz estimation to learn multiplier matrices that are used in semidefinite programming frameworks for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initialization and constraint enforcement while providing exponential stability of the equilibrium of the closed-loop system.


Journal Details

This article was published in the following journal.

Name: IEEE transactions on neural networks and learning systems
ISSN: 2162-2388


DeepDyve research library

PubMed Articles [7242 Associated PubMed Articles listed on BioPortfolio]

Robust min-max optimal control design for systems with uncertain models: A neural dynamic programming approach.

The design of an artificial neural network (ANN) based sub-optimal controller to solve the finite-horizon optimization problem for a class of systems with uncertainties is the main outcome of this stu...

Programmatic mapping and size estimation of key populations to inform HIV programming in Tanzania.

A programmatic mapping and size estimation study was conducted in 24 districts in 5 regions of Tanzania to estimate the size and locations of female sex workers (FSW) and men who have sex with men (MS...

Approximate Kernel Selection via Matrix Approximation.

Kernel selection is of fundamental importance for the generalization of kernel methods. This article proposes an approximate approach for kernel selection by exploiting the approximability of kernel s...

Convergence Analysis of Saturated Iterative Learning Control Systems With Locally Lipschitz Nonlinearities.

In this article, the robust trajectory tracking problem of iterative learning control (ILC) for uncertain nonlinear systems is considered, and the effects from locally Lipschitz nonlinearities, input ...

One-slice CT image based kernelized radiomics model for the prediction of low/mid-grade and high-grade HNSCC.

An accurate grade prediction can help to appropriate treatment strategy and effective diagnosis to Head and neck squamous cell carcinoma (HNSCC). Radiomics has been studied for the prediction of carci...

Clinical Trials [4532 Associated Clinical Trials listed on BioPortfolio]

Programming Cochlear Implant With Artificial Intelligence

This thesis project proposes to investigate the "state of the art" of the programming of the cochlear implant. In the center of audiophonologie Brussels, the classic 'manual programming' h...

Medtronic Shock-Less Study

The purpose of this clinical trial is to determine whether periodic therapy programming reports illustrating physician usage of shock reduction programming can increase utilization of reco...

Japanese Assessment of Indication Based Programming

The purpose of this clinical investigation is to assess the programming behavior of the physicians in the patient cohort. Furthermore, the acceptance level of specific programming recommen...

Automated Deep Brain Stimulation Programming Using Functional Mapping

Participants will undergo DBS programming guided by the DBS-Expert system and by a clinician per standard care.

The Safety of Remote DBS Programming System

Remote programming has significant advantages over conventional programming methods on some issues. This research will test the safety of SceneRay remote and wireless DBS programming syste...

Medical and Biotech [MESH] Definitions

A set of models of how communication impacts and is impacted by subjective experience. Techniques are generated from these models by sequencing of various aspects of the models in order to change someone's internal representations. Neurolinguistic programming is concerned with the patterns or programming created by the interactions among the brain, language, and the body, that produce both effective and ineffective behavior.

A technique of operations research for solving certain kinds of problems involving many variables where a best value or set of best values is to be found. It is most likely to be feasible when the quantity to be optimized, sometimes called the objective function, can be stated as a mathematical expression in terms of the various activities within the system, and when this expression is simply proportional to the measure of the activities, i.e., is linear, and when all the restrictions are also linear. It is different from computer programming, although problems using linear programming techniques may be programmed on a computer.

Specific languages used to prepare computer programs.

A structural subclass of streptogramins that includes molecules containing unsaturated macrocyclic lactones with approximate molecular weight of 500 KDa.

The highest dosage administered that does not produce toxic effects. The NOAEL will depend on how closely dosages are spaced (lowest-observed-adverse-effect level and no-observed-effect level) and the number of animals examined. The ultimate objective is usually to determine not the "safe" dosage in laboratory animals but the "safe" dosage for humans. Therefore, the extrapolation most often required of toxicologists is from high-dosage studies in laboratory animals to low doses in humans. (Casarett and Doull's Toxicology: The Basic Science of Poisons, 4th ed)

Quick Search

DeepDyve research library

Searches Linking to this Article