Article | REF: S7793 V1

Cooperation of multiple reinforcement learning algorithms

Authors: Benoît GIRARD, Mehdi KHAMASSI

Publication date: December 10, 2016 | Lire en français

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!

Automatically translated using artificial intelligence technology (Note that only the original version is binding) > find out more.

A | A

2. Methods for coordinating learning algorithms

Overall, three main families of combinations of multiple learning algorithms have been proposed: merging the outputs of these algorithms before making a decision; selecting one of these algorithms, which then takes sole control of the agent; this selection may result from monitoring the evolution of internal variables, or from a second learning layer.

2.1 Static fusion

If we're not looking to optimize the use of computing resources, but only to improve an agent's behavior, we can systematically calculate the outputs of all the learning systems, and then merge them before making a decision. The idea is then that actions that meet with consensus are probably the best.

The first method of coordinating learning algorithms

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!

The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors

+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year

From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Robotics

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Methods for coordinating learning algorithms

Previous
page Reinforcement learning

Conclusion

Bibliography

(1) - BALLEINE (B.W.), O'DOHERTY (J.P.) - Human and rodent homologies in action control : corticostriatal determinants of goal-directed and habitual action. - Neuropsychopharmacology, 35(1), 48-69, (2010).
(2) - BELLMAN (R.E.) - Dynamic Programming....

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!

The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors

+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year

From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

Outline
Full outline