Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

International audience

Data and Resources

Additional Info

Field Value
Source ISSN: 0885-6125
Author Antos, Andras, Szepesvari, Csaba, Munos, Rémi
Maintainer CCSD
Last Updated May 10, 2026, 20:59 (UTC)
Created May 10, 2026, 20:59 (UTC)
Identifier hal-00830201
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Computer and Automation Research Institute [Budapest] (MTA SZTAKI)
creator Antos, Andras
date 2008-05-10T00:00:00
harvest_object_id 67e06185-accf-4f24-8fb0-0f16b9d6319d
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-10-28T00:00:00
relation info:eu-repo/semantics/altIdentifier/doi/10.1007/s10994-007-5038-2
set_spec type:ART