-
Learning a Move-Generator for Upper Con dence Trees
International audience -
Off-policy Learning in Large-scale POMDP-based Dialogue Systems
International audience -
Dopaminergic control of the exploration-exploitation trade-off via the basal ...
International audience -
Robot cognitive control with a neurophysiologically inspired reinforcement le...
International audience -
Learning what to say and how to say it: joint optimization of spoken dialogue...
International audience -
Interaction with Machine Improvisation
International audience -
Finite-Sample Analysis of Least-Squares Policy Iteration
International audience -
Learning User Preferences in Ubiquitous Systems: A User Study and a Reinforce...
International audience -
Intrinsic Motivation for Autonomous Mental Development
International audience -
DS 2012 : Discovery Science
International audience -
Interactions between the Midbrain Superior Colliculus and the Basal Ganglia.
International audience -
Off-policy Learning with Eligibility Traces: A Survey
In the framework of Markov Decision Processes, off-policy learning, that is the problem of learning a linear approximation of the value function of some fixed policy... -
Adaptative transfer in reinforcement learning : application for simulation of...
A possible way to accelerate reinforcement learning process is to guide the exploration process using prior domain knowledge. This called knowledge transfer, and most... -
Performance Bounds for Lambda Policy Iteration and Application to the Game of...
International audience -
Feature discovery in reinforcement learning using genetic programming
International audience -
Basis Expansion in Natural Actor Critic Methods
International audience -
Reinforcement learning for microgrid energy management
International audience -
Learning near-optimal policies with Bellman-residual minimization based fitte...
International audience -
Hypervolume indicator and dominance reward based multi-objective Monte-Carlo ...
International audience -
Adaptive data collection protocol using reinforcement learning for VANETs
International audience
