Juan-Antonio Fernández-Madrigal | kipr

A study of the influence of teleoperation in the remote driving of robots

March 12, 2018 09:45 , Juan-Antonio Fernández-Madrigal

Storms, J. & Tilbury, D. J, A New Difficulty Index for Teleoperated Robots Driving through Obstacles, Intell Robot Syst (2018) 90: 147, DOI: 10.1007/s10846-017-0651-1.

Teleoperation allows humans to reach environments that would otherwise be too difficult or dangerous. The distance between the human operator and remote robot introduces a number of issues that can negatively impact system performance including degraded and delayed information exchange between the robot and human. Some operation scenarios and environments can tolerate these degraded conditions, while others cannot. However, little work has been done to investigate how factors such as communication delay, automation, and environment characteristics interact to affect teleoperation system performance. This paper presents results from a user study analyzing the effects of teleoperation factors including communication delay, autonomous assistance, and environment layout on user performance. A mobile robot driving task is considered in which subjects drive a robot to a goal location around obstacles as quickly (minimize time) and safely (avoid collisions) as possible. An environment difficulty index (ID) is defined in the paper and is shown to be able to predict the average time it takes for the human to drive the robot to a goal location with different obstacle configurations. The ID is also shown to predict the path chosen by the human better than travel time along that path.

Posted in: Human teleoperation

Multi-agent reinfocerment learning for working with high-dimensional spaces

March 10, 2018 09:40 , Juan-Antonio Fernández-Madrigal

David L. Leottau, Javier Ruiz-del-Solar, Robert Babuška, Decentralized Reinforcement Learning of Robot Behaviors, Artificial Intelligence, Volume 256, 2018, Pages 130-159, DOI: 10.1016/j.artint.2017.12.001.

A multi-agent methodology is proposed for Decentralized Reinforcement Learning (DRL) of individual behaviors in problems where multi-dimensional action spaces are involved. When using this methodology, sub-tasks are learned in parallel by individual agents working toward a common goal. In addition to proposing this methodology, three specific multi agent DRL approaches are considered: DRL-Independent, DRL Cooperative-Adaptive (CA), and DRL-Lenient. These approaches are validated and analyzed with an extensive empirical study using four different problems: 3D Mountain Car, SCARA Real-Time Trajectory Generation, Ball-Dribbling in humanoid soccer robotics, and Ball-Pushing using differential drive robots. The experimental validation provides evidence that DRL implementations show better performances and faster learning times than their centralized counterparts, while using less computational resources. DRL-Lenient and DRL-CA algorithms achieve the best final performances for the four tested problems, outperforming their DRL-Independent counterparts. Furthermore, the benefits of the DRL-Lenient and DRL-CA are more noticeable when the problem complexity increases and the centralized scheme becomes intractable given the available computational resources and training time.

Posted in: Applications of reinforcement learning to robots, Reinforcement learning in AI , Tagged: Multiagent systems

Survey of the modelling of agents (intentions, goals, etc.)

March 8, 2018 09:27 , Juan-Antonio Fernández-Madrigal

Stefano V. Albrecht, Peter Stone, Autonomous agents modelling other agents: A comprehensive survey and open problems, Artificial Intelligence,
Volume 258, 2018, Pages 66-95, DOI: 10.1016/j.artint.2018.01.002.

Much research in artificial intelligence is concerned with the development of autonomous agents that can interact effectively with other agents. An important aspect of such agents is the ability to reason about the behaviours of other agents, by constructing models which make predictions about various properties of interest (such as actions, goals, beliefs) of the modelled agents. A variety of modelling approaches now exist which vary widely in their methodology and underlying assumptions, catering to the needs of the different sub-communities within which they were developed and reflecting the different practical uses for which they are intended. The purpose of the present article is to provide a comprehensive survey of the salient modelling methods which can be found in the literature. The article concludes with a discussion of open problems which may form the basis for fruitful future research.

Posted in: Artificial Intelligence , Tagged: Multiagent systems, Survey

Improving the estimation of the offset parameter of heavy-tailed distributions through the injection of noise

March 8, 2018 09:17 , Juan-Antonio Fernández-Madrigal

Y. Pan, F. Duan, F. Chapeau-Blondeau and D. Abbott, Noise Enhancement in Robust Estimation of Location, IEEE Transactions on Signal Processing, vol. 66, no. 8, pp. 1953-1966, DOI: 10.1109/TSP.2018.2802463.

In this paper, we investigate the noise benefits to maximum likelihood type estimators (M-estimator) for the robust estimation of a location parameter. Two distinct noise benefits are shown to be accessible under these conditions. With symmetric heavy-tailed noise distributions, the asymptotic efficiency of the estimation can be enhanced by injecting extra noise into the M-estimators. With an asymmetric contaminated noise model having a convex cumulative distribution function, we demonstrate that addition of noise can reduce the maximum bias of the median estimator. These findings extend the analysis of stochastic resonance effects for noise-enhanced signal and information processing.

Posted in: Probability and statistics , Tagged: Heavy-tailed distributions, Probability distribution estimation

Using interactive reinforcement learning with the advisor being another reinforcement learning agent

March 2, 2018 09:07 , Juan-Antonio Fernández-Madrigal

Francisco Cruz, Sven Magg, Yukie Nagai & Stefan Wermter, Improving interactive reinforcement learning: What makes a good teacher?, Connection Science, DOI: 10.1080/09540091.2018.1443318.

Interactive reinforcement learning (IRL) has become an important apprenticeship approach to speed up convergence in classic reinforcement learning (RL) problems. In this regard, a variant of IRL is policy shaping which uses a parent-like trainer to propose the next action to be performed and by doing so reduces the search space by advice. On some occasions, the trainer may be another artificial agent which in turn was trained using RL methods to afterward becoming an advisor for other learner-agents. In this work, we analyse internal representations and characteristics of artificial agents to determine which agent may outperform others to become a better trainer-agent. Using a polymath agent, as compared to a specialist agent, an advisor leads to a larger reward and faster convergence of the reward signal and also to a more stable behaviour in terms of the state visit frequency of the learner-agents. Moreover, we analyse system interaction parameters in order to determine how influential they are in the apprenticeship process, where the consistency of feedback is much more relevant when dealing with different learner obedience parameters.

Posted in: Applications of reinforcement learning to robots, Reinforcement learning in AI , Tagged: Interactive reinfocerment learning, Reinforcement learning

Using memory of past input data to improve the convergence of NN when faced with small samples

February 28, 2018 08:39 , Juan-Antonio Fernández-Madrigal

Zhang, S., Huang, K., Zhang, R. et al., Learning from Few Samples with Memory Network, Cogn Comput (2018) 10: 15, DOI: 10.1007/s12559-017-9507-z.

Neural networks (NN) have achieved great successes in pattern recognition and machine learning. However, the success of a NN usually relies on the provision of a sufficiently large number of data samples as training data. When fed with a limited data set, a NN’s performance may be degraded significantly. In this paper, a novel NN structure is proposed called a memory network. It is inspired by the cognitive mechanism of human beings, which can learn effectively, even from limited data. Taking advantage of the memory from previous samples, the new model achieves a remarkable improvement in performance when trained using limited data. The memory network is demonstrated here using the multi-layer perceptron (MLP) as a base model. However, it would be straightforward to extend the idea to other neural networks, e.g., convolutional neural networks (CNN). In this paper, the memory network structure is detailed, the training algorithm is presented, and a series of experiments are conducted to validate the proposed framework. Experimental results show that the proposed model outperforms traditional MLP-based models as well as other competitive algorithms in response to two real benchmark data sets.

Posted in: Artificial Intelligence , Tagged: Neural networks, Small samples

using fuzzy Petri nets for mobile robot navigation

February 28, 2018 08:35 , Juan-Antonio Fernández-Madrigal

Seung-yun Kim, Yilin Yang, A self-navigating robot using Fuzzy Petri nets, Robotics and Autonomous Systems, Volume 101, 2018, Pages 153-165, DOI: 10.1016/j.robot.2017.11.008.

Petri nets (PNs) are capable of modeling nearly any conceivable system and can provide a better understanding of the idealized action sequence in which to most effectively describe or execute said system through their powerful analytical capabilities. However, because real world instances are rarely as consistent and ideal as simulated models, basic PN modeling and simulation properties may be insufficient in practical application. We remedy this through specialization in Fuzzy Petri nets (FPNs). Fuzzy logic is incorporated to better model a self-navigating robot algorithm, thanks to its versatile multi-valued logic reasoning. By using FPNs, it is possible to simulate, assess, and communicate the process and reasoning of the navigational algorithm and apply it to real world programming. In this paper, we propose a series of specific fuzzy algorithms intended to be implemented in concert on a mobile robot platform in order to optimize the sequence of actions needed for a given task, primarily the navigation of an unknown maze. A set of varied maze configurations were developed and simulated as PN and FPN models, providing a testing environment to examine the efficiency of several methodologies. Five methods, including an original proposal in this paper, were compared across 30,000 simulations, evaluating in particular performance in processing cost in time. Our experiments concluded with results suggesting a very competitive task completion time at a considerable fraction in processing cost compared to the closest performing alternatives.

Posted in: Robot motion planning , Tagged: Fuzzy logic, Petri nets

A model of others’ emotions that predicts very well experimental results

February 24, 2018 08:29 , Juan-Antonio Fernández-Madrigal

Rebecca Saxe, Seeing Other Minds in 3D, Trends in Cognitive Sciences, Volume 22, Issue 3, 2018, Pages 193-195, DOI: 10.1016/j.tics.2018.01.003.

Tamir and Thornton [1] have identified three key dimensions that organize our understanding of other minds. These dimensions (glossed as valence, social impact, and rationality) can capture the similarities and differences between concepts of internal experiences (anger, loneliness, gratitude), and also between concepts of personalities (aggressive, introverted, agreeable). Most impressively, the three dimensions explain the patterns of hemodynamic activity in our brains as we consider these experiences [2] (Box 1). States such as anger and gratitude are invisible, but the patterns evoked in our brain as we think about them are as predictable by the model of Tamir and Thornton as the patterns evoked in our visual cortex when we look at chairs, bicycles, or pineapples are predictable by models of high-level vision [3]. Human social prediction follows the same dimensions: observers predict that transitions are more likely between states that are ‘nearby’ in this abstract 3D space [4]. Thus, we expect that a friend now feeling ‘anxious’ will be more likely to feel ‘sluggish’ than ‘energetic’ later.

Posted in: Psycho-physiological bases of engineering , Tagged: Emotions, Mind models

Time synchronization (only offset) or power sinusoid signals

February 15, 2018 08:17 , Juan-Antonio Fernández-Madrigal

A. Mingotti, L. Peretto and R. Tinarelli, Accuracy Evaluation of an Equivalent Synchronization Method for Assessing the Time Reference in Power Networks, IEEE Transactions on Instrumentation and Measurement, vol. 67, no. 3, pp. 600-606, DOI: 10.1109/TIM.2017.2779328.

This paper deals with the evaluation of the accuracy performance of an approach for assessing the phase displacement between voltages at power network nodes. This task is accomplished by processing asynchronous measurements taken at each node. This turns into an equivalent synchronization, which is, therefore, obtained without exploiting any synchronization signals, such as the ones provided by means of wireless (i.e., global positioning system) or wired technologies. As a matter of fact, distribution system operators will gain the possibility of deploying, at more affordable costs, wide area measurement system (WAMS) over their power networks for enhancing their stability and reliability. Phasor measurement units (PMUs) are the most common examples of such WAMS, but, besides their high cost, there are circumstances where providing a time reference signal to remote PMUs often becomes a difficult task. This paper aims at recalling the basic theoretical principles of the method and at proving its applicability in power network through a deep analysis of its metrological performance.

Posted in: Electronics , Tagged: Clock synchronization

Hybridizing RRT with deliberative path planning to improve performance

February 13, 2018 07:54 , Juan-Antonio Fernández-Madrigal

Dong, Y., Camci, E. & Kayacan, Faster RRT-based Nonholonomic Path Planning in 2D Building Environments Using Skeleton-constrained Path Biasing, J Intell Robot Syst (2018) 89: 387, DOI: 10.1007/s10846-017-0567-9.

This paper presents a faster RRT-based path planning approach for regular 2-dimensional (2D) building environments. To minimize the planning time, we adopt the idea of biasing the RRT tree-growth in more focused ways. We propose to calculate the skeleton of the 2D environment first, then connect a geometrical path on the skeleton, and grow the RRT tree via the seeds generated locally along this path. We conduct batched simulations to find the universal parameters in manipulating the seeds generation. We show that the proposed skeleton-biased locally-seeded RRT (skilled-RRT) is faster than the other baseline planners (RRT, RRT*, A*-RRT, Theta*-RRT, and MARRT) through experimental tests using different vehicles in different 2D building environments. Given mild assumptions of the 2D environments, we prove that the proposed approach is probabilistically complete. We also present an application of the skilled-RRT for unmanned ground vehicle. Compared to the other baseline algorithms (Theta*-RRT and MARRT), we show the applicability and fast planning of the skilled-RRT in real environment.

Posted in: Robot motion planning , Tagged: Reactive / Deliberative, RRT

« Previous 1 … 46 47 48 49 50 … 77 Next »

Author Archives: Juan-antonio Fernández-madrigal

A study of the influence of teleoperation in the remote driving of robots

Storms, J. & Tilbury, D. J, A New Difficulty Index for Teleoperated Robots Driving through Obstacles, Intell Robot Syst (2018) 90: 147, DOI: 10.1007/s10846-017-0651-1.

Multi-agent reinfocerment learning for working with high-dimensional spaces

David L. Leottau, Javier Ruiz-del-Solar, Robert Babuška, Decentralized Reinforcement Learning of Robot Behaviors, Artificial Intelligence, Volume 256, 2018, Pages 130-159, DOI: 10.1016/j.artint.2017.12.001.

Survey of the modelling of agents (intentions, goals, etc.)

Stefano V. Albrecht, Peter Stone, Autonomous agents modelling other agents: A comprehensive survey and open problems, Artificial Intelligence,
Volume 258, 2018, Pages 66-95, DOI: 10.1016/j.artint.2018.01.002.

Improving the estimation of the offset parameter of heavy-tailed distributions through the injection of noise

Y. Pan, F. Duan, F. Chapeau-Blondeau and D. Abbott, Noise Enhancement in Robust Estimation of Location, IEEE Transactions on Signal Processing, vol. 66, no. 8, pp. 1953-1966, DOI: 10.1109/TSP.2018.2802463.

Using interactive reinforcement learning with the advisor being another reinforcement learning agent

Francisco Cruz, Sven Magg, Yukie Nagai & Stefan Wermter, Improving interactive reinforcement learning: What makes a good teacher?, Connection Science, DOI: 10.1080/09540091.2018.1443318.

Using memory of past input data to improve the convergence of NN when faced with small samples

Zhang, S., Huang, K., Zhang, R. et al., Learning from Few Samples with Memory Network, Cogn Comput (2018) 10: 15, DOI: 10.1007/s12559-017-9507-z.

using fuzzy Petri nets for mobile robot navigation

Seung-yun Kim, Yilin Yang, A self-navigating robot using Fuzzy Petri nets, Robotics and Autonomous Systems, Volume 101, 2018, Pages 153-165, DOI: 10.1016/j.robot.2017.11.008.

A model of others’ emotions that predicts very well experimental results

Rebecca Saxe, Seeing Other Minds in 3D, Trends in Cognitive Sciences, Volume 22, Issue 3, 2018, Pages 193-195, DOI: 10.1016/j.tics.2018.01.003.

Time synchronization (only offset) or power sinusoid signals

A. Mingotti, L. Peretto and R. Tinarelli, Accuracy Evaluation of an Equivalent Synchronization Method for Assessing the Time Reference in Power Networks, IEEE Transactions on Instrumentation and Measurement, vol. 67, no. 3, pp. 600-606, DOI: 10.1109/TIM.2017.2779328.

Hybridizing RRT with deliberative path planning to improve performance

Dong, Y., Camci, E. & Kayacan, Faster RRT-based Nonholonomic Path Planning in 2D Building Environments Using Skeleton-constrained Path Biasing, J Intell Robot Syst (2018) 89: 387, DOI: 10.1007/s10846-017-0567-9.

Post Navigation

Fields, areas and lines of research

Archives

Author Archives: Juan-antonio Fernández-madrigal

Storms, J. & Tilbury, D. J, A New Difficulty Index for Teleoperated Robots Driving through Obstacles, Intell Robot Syst (2018) 90: 147, DOI: 10.1007/s10846-017-0651-1.

David L. Leottau, Javier Ruiz-del-Solar, Robert Babuška, Decentralized Reinforcement Learning of Robot Behaviors, Artificial Intelligence, Volume 256, 2018, Pages 130-159, DOI: 10.1016/j.artint.2017.12.001.

Stefano V. Albrecht, Peter Stone, Autonomous agents modelling other agents: A comprehensive survey and open problems, Artificial Intelligence, Volume 258, 2018, Pages 66-95, DOI: 10.1016/j.artint.2018.01.002.

Y. Pan, F. Duan, F. Chapeau-Blondeau and D. Abbott, Noise Enhancement in Robust Estimation of Location, IEEE Transactions on Signal Processing, vol. 66, no. 8, pp. 1953-1966, DOI: 10.1109/TSP.2018.2802463.

Francisco Cruz, Sven Magg, Yukie Nagai & Stefan Wermter, Improving interactive reinforcement learning: What makes a good teacher?, Connection Science, DOI: 10.1080/09540091.2018.1443318.

Zhang, S., Huang, K., Zhang, R. et al., Learning from Few Samples with Memory Network, Cogn Comput (2018) 10: 15, DOI: 10.1007/s12559-017-9507-z.

Seung-yun Kim, Yilin Yang, A self-navigating robot using Fuzzy Petri nets, Robotics and Autonomous Systems, Volume 101, 2018, Pages 153-165, DOI: 10.1016/j.robot.2017.11.008.

Rebecca Saxe, Seeing Other Minds in 3D, Trends in Cognitive Sciences, Volume 22, Issue 3, 2018, Pages 193-195, DOI: 10.1016/j.tics.2018.01.003.

A. Mingotti, L. Peretto and R. Tinarelli, Accuracy Evaluation of an Equivalent Synchronization Method for Assessing the Time Reference in Power Networks, IEEE Transactions on Instrumentation and Measurement, vol. 67, no. 3, pp. 600-606, DOI: 10.1109/TIM.2017.2779328.

Dong, Y., Camci, E. & Kayacan, Faster RRT-based Nonholonomic Path Planning in 2D Building Environments Using Skeleton-constrained Path Biasing, J Intell Robot Syst (2018) 89: 387, DOI: 10.1007/s10846-017-0567-9.

Post Navigation

Fields, areas and lines of research

Transversal topics, methods and tools

Archives

Stefano V. Albrecht, Peter Stone, Autonomous agents modelling other agents: A comprehensive survey and open problems, Artificial Intelligence,
Volume 258, 2018, Pages 66-95, DOI: 10.1016/j.artint.2018.01.002.