{"id":1380,"date":"2023-07-17T09:10:36","date_gmt":"2023-07-17T08:10:36","guid":{"rendered":"https:\/\/babel.isa.uma.es\/kipr\/?p=1380"},"modified":"2023-07-17T10:05:03","modified_gmt":"2023-07-17T09:05:03","slug":"a-nice-survey-on-active-learning-in-particular-for-robotics","status":"publish","type":"post","link":"https:\/\/babel.isa.uma.es\/kipr\/?p=1380","title":{"rendered":"A nice survey on active learning, in particular for robotics"},"content":{"rendered":"<h4>Annalisa T. Taylor, Thomas A. Berrueta, Todd D. Murphey, <strong>Active learning in robotics: A review of control principles,<\/strong> . Mechatronics, Volume 77, 2021 <a href=\"https:\/\/doi.org\/10.1016\/j.mechatronics.2021.102576\" target=\"_blank\">DOI: 10.1016\/j.mechatronics.2021.102576<\/a>.<\/h4>\n<blockquote><p>Active learning is a decision-making process. In both abstract and physical settings, active learning demands<br \/>\nboth analysis and action. This is a review of active learning in robotics, focusing on methods amenable to<br \/>\nthe demands of embodied learning systems. Robots must be able to learn efficiently and flexibly through<br \/>\ncontinuous online deployment. This poses a distinct set of control-oriented challenges??one must choose<br \/>\nsuitable measures as objectives, synthesize real-time control, and produce analyses that guarantee performance<br \/>\nand safety with limited knowledge of the environment or robot itself. In this work, we survey the fundamental<br \/>\ncomponents of robotic active learning systems. We discuss classes of learning tasks that robots typically<br \/>\nencounter, measures with which they gauge the information content of observations, and algorithms for<br \/>\ngenerating action plans. Moreover, we provide a variety of examples ?? from environmental mapping to<br \/>\nnonparametric shape estimation ?? that highlight the qualitative differences between learning tasks, information<br \/>\nmeasures, and control techniques. We conclude with a discussion of control-oriented open challenges, including<br \/>\nsafety-constrained learning and distributed learning.<\/p><\/blockquote>\n<p><strong>NOTES:<\/strong><\/p>\n<ul>\n<li>RL can be considered one of the areas within computational learning theory, that usually ignore physical embodiment aspects of the learning agent. However, that is only so when RL explores through decision-making, not when it explores randomly, without much purpose of enhancing learning itself through its actions.<\/li>\n<li>RL caveats (particularly Deep RL): their large data requirements, lack of generalizability between tasks, as well as their inability to learn incrementally and guarantee<br \/>\nsafety.<\/li>\n<li>Bayesian filters can be seen as learner systems: they learn parameters of objects (pose) or environments (maps) aided by some models. However, they are more active learners when they use the robot actions to improve that parameter learning.<\/i>\n<li>Gaussian processes can be effective in learning those models when no parameterical form is available or much first-principle knowledge, for instance, when the robot has to learn the model only observing a small part of the environment (local).<\/li>\n<li>Entropy\/information, Fisher&#8217;s information (conditional information) and ergodicity are the main ways of measuring information gain in active learning.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Annalisa T. Taylor, Thomas A. Berrueta, Todd D. Murphey, Active learning in robotics: A review of control principles, . Mechatronics, <span class=\"ellipsis\">&hellip;<\/span> <span class=\"more-link-wrap\"><a href=\"https:\/\/babel.isa.uma.es\/kipr\/?p=1380\" class=\"more-link\"><span>Read More &rarr;<\/span><\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[35],"tags":[468,15],"class_list":["post-1380","post","type-post","status-publish","format-standard","hentry","category-psycho-physiological-bases-of-engineering","tag-active-learning","tag-reinforcement-learning"],"_links":{"self":[{"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/posts\/1380"}],"collection":[{"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1380"}],"version-history":[{"count":5,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/posts\/1380\/revisions"}],"predecessor-version":[{"id":1385,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=\/wp\/v2\/posts\/1380\/revisions\/1385"}],"wp:attachment":[{"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1380"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1380"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/babel.isa.uma.es\/kipr\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1380"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}