


default search action
8th EWRL 2008: Villeneuve d'Ascq, France
- Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko:

Recent Advances in Reinforcement Learning, 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30 - July 3, 2008, Revised and Selected Papers. Lecture Notes in Computer Science 5323, Springer 2008, ISBN 978-3-540-89721-7 - Boris Defourny

, Damien Ernst, Louis Wehenkel
:
Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees. 1-14 - Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin

:
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning. 15-26 - Christos Dimitrakakis

, Michail G. Lagoudakis
:
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration. 27-40 - Kirill Dyagilev, Shie Mannor

, Nahum Shimkin:
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. 41-54 - Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor

:
Regularized Fitted Q-Iteration: Application to Planning. 55-68 - Sarah Filippi, Olivier Cappé, Fabrice Clérot, Eric Moulines:

A Near Optimal Policy for Channel Allocation in Cognitive Radio. 69-81 - Thomas Gabel, Martin A. Riedmiller:

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets. 82-95 - Matthieu Geist, Olivier Pietquin

, Gabriel Fricout:
Bayesian Reward Filtering. 96-109 - Sertan Girgin, Philippe Preux:

Basis Expansion in Natural Actor Critic Methods. 110-123 - Robby Goetschalckx, Scott Sanner, Kurt Driessens:

Reinforcement Learning with the Use of Costly Features. 124-135 - Verena Heidrich-Meisner, Christian Igel:

Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem. 136-150 - Jean-François Hren, Rémi Munos:

Optimistic Planning of Deterministic Systems. 151-164 - Yuxi Li, Dale Schuurmans:

Policy Iteration for Learning an Exercise Policy for American Options. 165-178 - Daniele Loiacono

, Pier Luca Lanzi
:
Tile Coding Based on Hyperplane Tiles. 179-190 - José David Martín-Guerrero

, Emilio Soria-Olivas
, Marcelino Martínez-Sober
, Antonio J. Serrano-López, José Rafael Magdalena Benedicto
, Juan Gómez-Sanchís
:
Use of Reinforcement Learning in Two Real Applications. 191-204 - Francis Maes, Ludovic Denoyer, Patrick Gallinari:

Applications of Reinforcement Learning to Structured Prediction. 205-219 - Jan Peters

, Jens Kober
, Duy Nguyen-Tuong:
Policy Learning - A Unified Perspective with Applications in Robotics. 220-228 - Carl Edward Rasmussen, Marc Peter Deisenroth:

Probabilistic Inference for Fast Learning in Control. 229-242 - Noel Welsh, Jeremy L. Wyatt

:
United We Stand: Population Based Methods for Solving Unknown POMDPs. 243-252 - Huizhen Yu, Dimitri P. Bertsekas:

New Error Bounds for Approximations from Projected Linear Equations. 253-267 - Jia Yuan Yu, Shie Mannor

, Nahum Shimkin:
Markov Decision Processes with Arbitrary Reward Processes. 268-281

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














