Behavior adaptation by means of reinforcement learning

Carreras, M.; El-fakdi, A.; Ridao, P.

doi:/10.1007/978-1-4614-5659-9_7

Zoeken

Zoeken kan via de modus 'eenvoudig zoeken' (één veld) of uitgebreid via 'geavanceerd zoeken' (meerdere velden). Zo kan je bv. zoeken op een combinatie van een auteursnaam (auteur), een jaartal (jaar) en een documenttype.

Boekenmand

Nuttige resultaten kan je aanvinken en toevoegen aan een mandje. De inhoud hiervan kan je exporteren of afdrukken (naar bv. PDF).

RSS

Op de hoogte blijven van nieuw toegevoegde publicaties binnen uw interessegebied? Dit kan door een RSS-feed (?) te maken van jouw zoekopdracht.

log in

Publicaties | Kaarten

nieuwe zoekopdracht

[ meld een fout in dit record ]

mandje (0): toevoegen | toon

Behavior adaptation by means of reinforcement learning

Carreras, M.; El-fakdi, A.; Ridao, P. (2012). Behavior adaptation by means of reinforcement learning, in: Seto, M.L. (2013). Marine robot autonomy. pp. 287-328. https://dx.doi.org/10.1007/978-1-4614-5659-9_7

In: Seto, M.L. (2013). Marine robot autonomy. Springer Science+Business Media, Inc: New York. ISBN 978-1-4614-5658-2. 382 pp. https://dx.doi.org/10.1007/978-1-4614-5659-9

Beschikbaar in	Auteurs
VLIZ [ aanvragen ]

Author keywords

Optimal Policy, Reinforcement Learning, Autonomous Underwater Vehicle, Learning Sample, Future Reward

Auteurs		Top
Carreras, M. El-fakdi, A. Ridao, P.

Abstract

Machine learning techniques can be used for learning the action-decision problem that most autonomous robots have when working in unknown and changing environments. Reinforcement learning (RL) offers the possibility of learning a state-action policy that solves a particular task without any previous experience. A reinforcement function, designed by a human operator, is the only required information to determine, after some experimentation, the way of solving the task. This chapter proposes the use of RL algorithms to learn reactive AUV behaviors and therefore not having to define the state-action mapping to solve the task. The algorithms will find the policy that optimizes the task and will adapt to any environment dynamics encountered. The advantage of the approach is that the same algorithms can be applied to a range of tasks, assuming that the problem is correctly sensed and defined. The two main methodologies that have been applied in RL-based robot learning for the past 2 decades, value-function methods and policy gradient methods, are presented in this chapter and evaluated in two AUV tasks. In both cases, a well-known theoretical algorithm has been modified to fulfill the requirements of the AUV task and has been applied with a real AUV. Results show the effectiveness of both approaches, each of them with some advantages and disadvantages, and point out the further investigation of these methods for making AUVs perform more robustly and adaptively in future applications.

Alle informatie in het Integrated Marine Information System (IMIS) valt onder het VLIZ Privacy beleid

Top | Auteurs

IMIS is ontwikkeld en wordt gehost door het VLIZ.

Catalogus

Waterbouwkundig Laboratorium Hoofdkantoor

Subscribe to our newsletter

FLANDERS HYDRAULICS

MARITIME TECHNOLOGY DIVISION

U bent hier

Catalogus

Waterbouwkundig Laboratorium Hoofdkantoor

Volg ons

Subscribe to our newsletter

FLANDERS HYDRAULICS

MARITIME TECHNOLOGY DIVISION