TY - GEN
T1 - Solving Safety Problems with Ensemble Reinforcement Learning
AU - Ferreira, Leonardo A.
AU - dos Santos, Thiago F.
AU - Bianchi, Reinaldo A.C.
AU - Santos, Paulo E.
PY - 2019
Y1 - 2019
N2 - An agent that learns by interacting with an environment may find unexpected solutions to decision-making problems. This solution can be an improvement over well-known ones, such as new strategies for games, but in some cases the unexpected solution is unwanted and should be avoided for reasons such as safety. This paper proposes a Reinforcement Learning Ensemble Framework called ReLeEF. This framework combines decision making methods to provide a finer grained control of the agent’s behaviour while still letting it learn by interacting with the environment. It has been tested in the safety gridworlds and the results show that it can find optimal solutions while fulfilling safety concerns described for each domain, something that state of the art Deep Reinforcement Learning methods were unable to do.
AB - An agent that learns by interacting with an environment may find unexpected solutions to decision-making problems. This solution can be an improvement over well-known ones, such as new strategies for games, but in some cases the unexpected solution is unwanted and should be avoided for reasons such as safety. This paper proposes a Reinforcement Learning Ensemble Framework called ReLeEF. This framework combines decision making methods to provide a finer grained control of the agent’s behaviour while still letting it learn by interacting with the environment. It has been tested in the safety gridworlds and the results show that it can find optimal solutions while fulfilling safety concerns described for each domain, something that state of the art Deep Reinforcement Learning methods were unable to do.
KW - Ontology
KW - Reinforcement Learning
KW - Safety
UR - http://www.scopus.com/inward/record.url?scp=85076546319&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-35288-2_17
DO - 10.1007/978-3-030-35288-2_17
M3 - Conference contribution
AN - SCOPUS:85076546319
SN - 9783030352875
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 203
EP - 214
BT - AI 2019
A2 - Liu, Jixue
A2 - Bailey, James
PB - Springer
CY - Cham, Switzerland
T2 - 32nd Australasian Joint Conference on Artificial Intelligence, AI 2019
Y2 - 2 December 2019 through 5 December 2019
ER -