Dataset - CKAN

Off-policy Learning with Eligibility Traces: A Survey

In the framework of Markov Decision Processes, off-policy learning, that is the problem of learning a linear approximation of the value function of some fixed policy...
- HTML
Learning near-optimal policies with Bellman-residual minimization based fitte...

International audience
- HTML

You can also access this registry using the API (see API Docs).

2 datasets found