Robotics | kipr

Reinforcement learning to learn the model of the world intrinsically motivated

June 27, 2017 12:10 , Juan-Antonio Fernández-Madrigal

Todd Hester, Peter Stone, Intrinsically motivated model learning for developing curious robots, Artificial Intelligence, Volume 247, June 2017, Pages 170-186, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.05.002.

Reinforcement Learning (RL) agents are typically deployed to learn a specific, concrete task based on a pre-defined reward function. However, in some cases an agent may be able to gain experience in the domain prior to being given a task. In such cases, intrinsic motivation can be used to enable the agent to learn a useful model of the environment that is likely to help it learn its eventual tasks more efficiently. This paradigm fits robots particularly well, as they need to learn about their own dynamics and affordances which can be applied to many different tasks. This article presents the texplore with Variance-And-Novelty-Intrinsic-Rewards algorithm (texplore-vanir), an intrinsically motivated model-based RL algorithm. The algorithm learns models of the transition dynamics of a domain using random forests. It calculates two different intrinsic motivations from this model: one to explore where the model is uncertain, and one to acquire novel experiences that the model has not yet been trained on. This article presents experiments demonstrating that the combination of these two intrinsic rewards enables the algorithm to learn an accurate model of a domain with no external rewards and that the learned model can be used afterward to perform tasks in the domain. While learning the model, the agent explores the domain in a developing and curious way, progressively learning more complex skills. In addition, the experiments show that combining the agent’s intrinsic rewards with external task rewards enables the agent to learn faster than using external rewards alone. We also present results demonstrating the applicability of this approach to learning on robots.

Posted in: Applications of reinforcement learning to robots, Artificial Intelligence , Tagged: Model-based reinforcement learning, Motivational robotics

State of the art and historical background of the classical divergence between AI and robotics

June 27, 2017 12:03 , Juan-Antonio Fernández-Madrigal

Kanna Rajan, Alessandro Saffiotti, Towards a science of integrated AI and Robotics, Artificial Intelligence, Volume 247, June 2017, Pages 1-9, ISSN 0004-3702, DOI: 10.1016/j.artint.2017.03.003.

The early promise of the impact of machine intelligence did not involve the partitioning of the nascent field of Artificial Intelligence. The founders of AI envisioned the notion of embedded intelligence as being conjoined between perception, reasoning and actuation. Yet over the years the fields of AI and Robotics drifted apart. Practitioners of AI focused on problems and algorithms abstracted from the real world. Roboticists, generally with a background in mechanical and electrical engineering, concentrated on sensori-motor functions. That divergence is slowly being bridged with the maturity of both fields and with the growing interest in autonomous systems. This special issue brings together the state of the art and practice of the emergent field of integrated AI and Robotics, and highlights the key areas along which this current evolution of machine intelligence is heading.

Posted in: Artificial Intelligence, Robotics , Tagged: AI vs Robotics

How “behaviour trees” generalize the subsumption architecture and some other control architecture frameworks

June 27, 2017 11:39 , Juan-Antonio Fernández-Madrigal

M. Colledanchise and P. Ãgren, “How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees,” in IEEE Transactions on Robotics, vol. 33, no. 2, pp. 372-389, April 2017.DOI: 10.1109/TRO.2016.2633567.

Behavior trees (BTs) are a way of organizing the switching structure of a hybrid dynamical system (HDS), which was originally introduced in the computer game programming community. In this paper, we analyze how the BT representation increases the modularity of an HDS and how key system properties are preserved over compositions of such systems, in terms of combining two BTs into a larger one. We also show how BTs can be seen as a generalization of sequential behavior compositions, the subsumption architecture, and decisions trees. These three tools are powerful but quite different, and the fact that they are unified in a natural way in BTs might be a reason for their popularity in the gaming community. We conclude the paper by giving a set of examples illustrating how the proposed analysis tools can be applied to robot control BTs.

Posted in: Robotic architectures , Tagged: Behaviour-based architectures, Subsumption architecture

On the current limitations of robotics research concerning the generalization of reported results to different set-ups

June 27, 2017 09:11 , Juan-Antonio Fernández-Madrigal

Francesco Amigoni, Matteo Luperto, Viola Schiaffonati,Toward generalization of experimental results for autonomous robots, Robotics and Autonomous Systems, Volume 90, April 2017, Pages 4-14, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.08.016.

In this paper we discuss some issues in the experimental evaluation of intelligent autonomous systems, focusing on systems, like autonomous robots, operating in physical environments. We argue that one of the weaknesses of current experimental practices is the low degree of generalization of experimental results, meaning that knowing the performance a robot system obtains in a test setting does not provide much information about the performance the same system could achieve in other settings. We claim that one of the main obstacles to achieve generalization of experimental results in autonomous robotics is the low degree of representativeness of the selected experimental settings. We survey and discuss the degree of representativeness of experimental settings used in a significant sample of current research and we propose some strategies to overcome the emerging limitations.

Posted in: Robotics , Tagged: Research practices

Robots that pre-compute a number of possible behaviours (in simulation) and then learn their performance with them (propragating that performance measures to similar behaviors through Gaussian Processes Regression) and select the best at each situation (through Bayesian Optimization), thus confronting varying environments and damages to the robot

June 27, 2017 09:02 , Juan-Antonio Fernández-Madrigal

A. Cully, et al. Robots that can adapt like animals, Nature, 521 (2015), pp. 503–507, DOI: 10.1038/nature14422.

Robots have transformed many industries, most notably manufacturing, and have the power to deliver tremendous benefits to society, such as in search and rescue, disaster response, health care and transportation. They are also invaluable tools for scientific exploration in environments inaccessible to humans, from distant planets to deep oceans. A major obstacle to their widespread adoption in more complex environments outside factories is their fragility. Whereas animals can quickly adapt to injuries, current robots cannot think outside the box to find a compensatory behaviour when they are damaged: they are limited to their pre-specified self-sensing abilities, can diagnose only anticipated failure modes, and require a pre-programmed contingency plan for every type of potential damage, an impracticality for complex robots. A promising approach to reducing robot fragility involves having robots learn appropriate behaviours in response to damage, but current techniques are slow even with small, constrained search spaces. Here we introduce an intelligent trial-and-error algorithm that allows robots to adapt to damage in less than two minutes in large search spaces without requiring self-diagnosis or pre-specified contingency plans. Before the robot is deployed, it uses a novel technique to create a detailed map of the space of high-performing behaviours. This map represents the robotÃ¢ â¬â¢ s prior knowledge about what behaviours it can perform and their value. When the robot is damaged, it uses this prior knowledge to guide a trial-and-error learning algorithm that conducts intelligent experiments to rapidly discover a behaviour that compensates for the damage. Experiments reveal successful adaptations for a legged robot injured in five different ways, including damaged, broken, and missing legs, and for a robotic arm with joints broken in 14 different ways. This new algorithm will enable more robust, effective, autonomous robots, and may shed light on the principles that animals use to adapt to injury.

Posted in: Robotics , Tagged: Bayesian optimization, Behaviour-based architectures, Gaussian processes

Qualitative robot navigation

June 22, 2017 10:06 , Juan-Antonio Fernández-Madrigal

Sergio Miguel-Tomé, Navigation through unknown and dynamic open spaces using topological notions, Connection Science, DOI: 10.1080/09540091.2016.1277691.

Until now, most algorithms used for navigation have had the purpose of directing system towards one point in space. However, humans communicate tasks by specifying spatial relations among elements or places. In addition, the environments in which humans develop their activities are extremely dynamic. The only option that allows for successful navigation in dynamic and unknown environments is making real-time decisions. Therefore, robots capable of collaborating closely with human beings must be able to make decisions based on the local information registered by the sensors and interpret and express spatial relations. Furthermore, when one person is asked to perform a task in an environment, this task is communicated given a category of goals so the person does not need to be supervised. Thus, two problems appear when one wants to create multifunctional robots: how to navigate in dynamic and unknown environments using spatial relations and how to accomplish this without supervision. In this article, a new architecture to address the two cited problems is presented, called the topological qualitative navigation architecture. In previous works, a qualitative heuristic called the heuristic of topological qualitative semantics (HTQS) has been developed to establish and identify spatial relations. However, that heuristic only allows for establishing one spatial relation with a specific object. In contrast, navigation requires a temporal sequence of goals with different objects. The new architecture attains continuous generation of goals and resolves them using HTQS. Thus, the new architecture achieves autonomous navigation in dynamic or unknown open environments.

Posted in: Robot motion planning , Tagged: Robot navigation

Spatio-temporal maps for mobile robots: taking into account time into the map

June 22, 2017 09:14 , Juan-Antonio Fernández-Madrigal

João Machado Santos, Tomáš Krajník, Tom Duckett, Spatio-temporal exploration strategies for long-term autonomy of mobile robots, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 116-126, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.016.

We present a study of spatio-temporal environment representations and exploration strategies for long-term deployment of mobile robots in real-world, dynamic environments. We propose a new concept for life-long mobile robot spatio-temporal exploration that aims at building, updating and maintaining the environment model during the long-term deployment. The addition of the temporal dimension to the explored space makes the exploration task a never-ending data-gathering process, which we address by application of information-theoretic exploration techniques to world representations that model the uncertainty of environment states as probabilistic functions of time. We evaluate the performance of different exploration strategies and temporal models on real-world data gathered over the course of several months. The combination of dynamic environment representations with information-gain exploration principles allows to create and maintain up-to-date models of continuously changing environments, enabling efficient and self-improving long-term operation of mobile robots.

Posted in: Mobile robot mapping , Tagged: Mapping of dynamic environments, Spatio-temporal maps

Efficient detection of glass obstacles when using a laser rangefinder

June 22, 2017 09:07 , Juan-Antonio Fernández-Madrigal

Xun Wang, JianGuo Wang, Detecting glass in Simultaneous Localisation and Mapping, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 97-103, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.003.

Simultaneous Localisation and Mapping (SLAM) has become one of key technologies used in advanced robot platform. The current state-of-art indoor SLAM with laser scanning rangefinders can provide accurate realtime localisation and mapping service to mobile robotic platforms such as PR2 robot. In recent years, many modern building designs feature large glass panels as one of the key interior fitting elements, e.g. large glass walls. Due to the transparent nature of glass panels, laser rangefinders are unable to produce accurate readings which causes SLAM functioning incorrectly in these environments. In this paper, we propose a simple and effective solution to identify glass panels based on the specular reflection of laser beams from the glass. Specifically, we use a simple technique to detect the reflected light intensity profile around the normal incident angle to the glass panel. Integrating this glass detection method with an existing SLAM algorithm, our SLAM system is able to detect and localise glass obstacles in realtime. Furthermore, the tests we conducted in two office buildings with a PR2 robot show the proposed method can detect ∼ 95% of all glass panels with no false positive detection. The source code of the modified SLAM with glass detection is released as a open source ROS package along with this paper.

Posted in: Robot sensors , Tagged: Laser rangefinder

Improving sensory information, diagnosis and fault tolerance by using multiple sensors and sensor fusion, with a good related work section (2.3) on fault tolerance on data fusion

June 22, 2017 09:00 , Juan-Antonio Fernández-Madrigal

Kaci Bader, Benjamin Lussier, Walter Schön, A fault tolerant architecture for data fusion: A real application of Kalman filters for mobile robot localization, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 11-23, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.015.

Multisensor perception has an important role in robotics and autonomous systems, providing inputs for critical functions including obstacle detection and localization. It is starting to appear in critical applications such as drones and ADASs (Advanced Driver Assistance Systems). However, this kind of complex system is difficult to validate comprehensively. In this paper we look at multisensor perception systems in relation to an alternative dependability method, namely fault tolerance. We propose an approach for tolerating faults in multisensor data fusion that is based on the more traditional method of duplication–comparison, and that offers detection and recovery services. We detail an example implementation using Kalman filter data fusion for mobile robot localization. We demonstrate its effectiveness in this case study using real data and fault injection.

Posted in: Robot sensors , Tagged: Fault tolerance, Recursive bayesian estimation, Sensor diagnosis

A nice summary of motion planning

June 21, 2017 12:28 , Juan-Antonio Fernández-Madrigal

J. J. M. Lunenburg, S. A. M. Coenen, G. J. L. Naus, M. J. G. van de Molengraft and M. Steinbuch, “Motion Planning for Mobile Robots: A Method for the Selection of a Combination of Motion-Planning Algorithms,” in IEEE Robotics & Automation Magazine, vol. 23, no. 4, pp. 107-117, Dec. 2016. DOI: 10.1109/MRA.2015.2510798.

A motion planner for mobile robots is commonly built out of a number of algorithms that solve the two steps of motion planning: 1) representing the robot and its environment and 2) searching a path through the represented environment. However, the available literature on motion planning lacks a generic methodology to arrive at a combination of representations and search algorithm classes for a practical application. This article presents a method to select appropriate algorithm classes that solve both the steps of motion planning and to select a suitable approach to combine those algorithm classes. The method is verified by comparing its outcome with three different motion planners that have been successfully applied on robots in practice.

Posted in: Robot motion planning , Tagged: Useful for teaching

« Previous 1 … 27 28 29 30 31 … 37 Next »

Category Archives: Robotics

Reinforcement learning to learn the model of the world intrinsically motivated

Todd Hester, Peter Stone, Intrinsically motivated model learning for developing curious robots, Artificial Intelligence, Volume 247, June 2017, Pages 170-186, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.05.002.

State of the art and historical background of the classical divergence between AI and robotics

Kanna Rajan, Alessandro Saffiotti, Towards a science of integrated AI and Robotics, Artificial Intelligence, Volume 247, June 2017, Pages 1-9, ISSN 0004-3702, DOI: 10.1016/j.artint.2017.03.003.

How “behaviour trees” generalize the subsumption architecture and some other control architecture frameworks

M. Colledanchise and P. Ãgren, “How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees,” in IEEE Transactions on Robotics, vol. 33, no. 2, pp. 372-389, April 2017.DOI: 10.1109/TRO.2016.2633567.

On the current limitations of robotics research concerning the generalization of reported results to different set-ups

Francesco Amigoni, Matteo Luperto, Viola Schiaffonati,Toward generalization of experimental results for autonomous robots, Robotics and Autonomous Systems, Volume 90, April 2017, Pages 4-14, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.08.016.

A. Cully, et al. Robots that can adapt like animals, Nature, 521 (2015), pp. 503–507, DOI: 10.1038/nature14422.

Qualitative robot navigation

Sergio Miguel-Tomé, Navigation through unknown and dynamic open spaces using topological notions, Connection Science, DOI: 10.1080/09540091.2016.1277691.

Spatio-temporal maps for mobile robots: taking into account time into the map

João Machado Santos, Tomáš Krajník, Tom Duckett, Spatio-temporal exploration strategies for long-term autonomy of mobile robots, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 116-126, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.016.

Efficient detection of glass obstacles when using a laser rangefinder

Xun Wang, JianGuo Wang, Detecting glass in Simultaneous Localisation and Mapping, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 97-103, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.003.

Improving sensory information, diagnosis and fault tolerance by using multiple sensors and sensor fusion, with a good related work section (2.3) on fault tolerance on data fusion

Kaci Bader, Benjamin Lussier, Walter Schön, A fault tolerant architecture for data fusion: A real application of Kalman filters for mobile robot localization, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 11-23, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.015.

A nice summary of motion planning

Post Navigation

Fields, areas and lines of research

Archives

Category Archives: Robotics

Todd Hester, Peter Stone, Intrinsically motivated model learning for developing curious robots, Artificial Intelligence, Volume 247, June 2017, Pages 170-186, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.05.002.

Kanna Rajan, Alessandro Saffiotti, Towards a science of integrated AI and Robotics, Artificial Intelligence, Volume 247, June 2017, Pages 1-9, ISSN 0004-3702, DOI: 10.1016/j.artint.2017.03.003.

M. Colledanchise and P. Ãgren, “How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees,” in IEEE Transactions on Robotics, vol. 33, no. 2, pp. 372-389, April 2017.DOI: 10.1109/TRO.2016.2633567.

Francesco Amigoni, Matteo Luperto, Viola Schiaffonati,Toward generalization of experimental results for autonomous robots, Robotics and Autonomous Systems, Volume 90, April 2017, Pages 4-14, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.08.016.

A. Cully, et al. Robots that can adapt like animals, Nature, 521 (2015), pp. 503–507, DOI: 10.1038/nature14422.

Sergio Miguel-Tomé, Navigation through unknown and dynamic open spaces using topological notions, Connection Science, DOI: 10.1080/09540091.2016.1277691.

João Machado Santos, Tomáš Krajník, Tom Duckett, Spatio-temporal exploration strategies for long-term autonomy of mobile robots, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 116-126, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.016.

Xun Wang, JianGuo Wang, Detecting glass in Simultaneous Localisation and Mapping, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 97-103, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.003.

Kaci Bader, Benjamin Lussier, Walter Schön, A fault tolerant architecture for data fusion: A real application of Kalman filters for mobile robot localization, Robotics and Autonomous Systems, Volume 88, February 2017, Pages 11-23, ISSN 0921-8890, DOI: 10.1016/j.robot.2016.11.015.

Post Navigation

Fields, areas and lines of research

Transversal topics, methods and tools

Archives

M. Colledanchise and P. Ãgren, “How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees,” in IEEE Transactions on Robotics, vol. 33, no. 2, pp. 372-389, April 2017.DOI: 10.1109/TRO.2016.2633567.