Dataset - CKAN

Learning a Move-Generator for Upper Con dence Trees

International audience
- HTML
Off-policy Learning in Large-scale POMDP-based Dialogue Systems

International audience
- HTML
Dopaminergic control of the exploration-exploitation trade-off via the basal ...

International audience
- HTML
Robot cognitive control with a neurophysiologically inspired reinforcement le...

International audience
- HTML
Learning what to say and how to say it: joint optimization of spoken dialogue...

International audience
- HTML
Interaction with Machine Improvisation

International audience
- HTML
Finite-Sample Analysis of Least-Squares Policy Iteration

International audience
- HTML
Learning User Preferences in Ubiquitous Systems: A User Study and a Reinforce...

International audience
- HTML
Intrinsic Motivation for Autonomous Mental Development

International audience
- HTML
DS 2012 : Discovery Science

International audience
- HTML
Interactions between the Midbrain Superior Colliculus and the Basal Ganglia.

International audience
- HTML
Off-policy Learning with Eligibility Traces: A Survey

In the framework of Markov Decision Processes, off-policy learning, that is the problem of learning a linear approximation of the value function of some fixed policy...
- HTML
Adaptative transfer in reinforcement learning : application for simulation of...

A possible way to accelerate reinforcement learning process is to guide the exploration process using prior domain knowledge. This called knowledge transfer, and most...
- HTML
Performance Bounds for Lambda Policy Iteration and Application to the Game of...

International audience
- HTML
Feature discovery in reinforcement learning using genetic programming

International audience
- HTML
Basis Expansion in Natural Actor Critic Methods

International audience
- HTML
Reinforcement learning for microgrid energy management

International audience
- HTML
Learning near-optimal policies with Bellman-residual minimization based fitte...

International audience
- HTML
Hypervolume indicator and dominance reward based multi-objective Monte-Carlo ...

International audience
- HTML
Adaptive data collection protocol using reinforcement learning for VANETs

International audience
- HTML

1
2
3
»

You can also access this registry using the API (see API Docs).