Téléchargement | - Voir la version finale : Balancing information with observation costs in deep reinforcement learning (PDF, 2.2 Mio)
|
---|
Lien | https://caiac.pubpub.org/pub/0jmy7gpd/release/1 |
---|
Auteur | Rechercher : Bellinger, Colin1; Rechercher : Drozdyuk, Andriy1; Rechercher : Crowley, Mark; Rechercher : Tamblyn, Isaac2 |
---|
Affiliation | - Conseil national de recherches du Canada. Technologies numériques
- Conseil national de recherches du Canada. Technologies de sécurité et de rupture
|
---|
Format | Texte, Article |
---|
Conférence | The 35th Canadian Conference on Artificial Intelligence, May 30 - June 3, 2022, Toronto, ON., Virtual |
---|
Description physique | 12 p. |
---|
Sujet | deep reinforcement learning; partial observability; state measurement costs |
---|
Résumé | The use of reinforcement learning (RL) in scientific applications, such as materials design and automated chemistry, is increasing. A major challenge, however, lies in fact that measuring the state of the system is often costly and time consuming in scientific applications, whereas policy learning with RL requires a measurement after each time step. In this work, we make the measurement costs explicit in the form of a costed reward and propose the active-measure with costs framework that enables off-the-shelf deep RL algorithms to learn a policy for both selecting actions and determining whether or not to measure the state of the system at each time step. In this way, the agents learn to balance the need for information with the cost of information. Our results show that when trained under this regime, the Dueling DQN and PPO agents can learn optimal action policies whilst making up to 50\% fewer state measurements, and recurrent neural networks can produce a greater than 50\% reduction in measurements. We postulate the these reduction can help to lower the barrier to applying RL to real-world scientific applications. |
---|
Date de publication | 2022-05-27 |
---|
Maison d’édition | Canadian Artificial Intelligence Association |
---|
Licence | |
---|
Dans | |
---|
Langue | anglais |
---|
Publications évaluées par des pairs | Oui |
---|
Exporter la notice | Exporter en format RIS |
---|
Signaler une correction | Signaler une correction (s'ouvre dans un nouvel onglet) |
---|
Identificateur de l’enregistrement | 2e701a0c-744b-4c24-b5ce-b8191023ca33 |
---|
Enregistrement créé | 2022-06-22 |
---|
Enregistrement modifié | 2022-06-22 |
---|