kipr | Scientific papers that were of interest for Prof. Juan-Antonio Fernández-Madrigal

Improvement on the classical regression-based estimation algorithm of the relative clock frequency of two remotely connected clocks for better behaviour under outliers, and a good related works section on the estimation of clock relative frequency

December 22, 2015 17:57 , Juan-Antonio Fernández-Madrigal

Oka Saputra, K.; Wei-Chung Teng; Tsung-Han Chen, Hough Transform-Based Clock Skew Measurement Over Network, in Instrumentation and Measurement, IEEE Transactions on , vol.64, no.12, pp.3209-3216, Dec. 2015, DOI: 10.1109/TIM.2015.2450293.

The accurate clock skew measurement of remote devices over network connections is crucial to device fingerprinting and other related applications. Current approaches use the lower bound of offsets between the target device and the measurer to estimate clock skew; however, the accuracy of estimation is severely affected when even a few offsets appear below the crowd of offsets. This paper adopted the Hough transform to develop a new method, which searches for the densest part of the whole distribution. This method is effective in filtering out the upper and lower outliers such that the skew values derived from the remaining offsets are stable, even when lower outliers occur, or when the measuring time is not long enough for current approaches to achieve stable results. The experimental evaluation of the proposed method has been conducted in order to compare its performance with that of linear programming algorithm (LPA) and two other approaches. During the five consecutive measurements of 1000 offsets each, skews of the proposed method varied within the range of 0.59 ppm, whereas LPA resulted in the range of 0.89 ppm. Both ranges increased to 1.34 and 63.93 ppm, respectively, when the lower bounds encountered interference from lower outliers.

Notes:

They assume there is no NTP running in the background; however, their results seem to come from a conventional TCP/IP network, where it is difficult not to find NTP enabled.

Posted in: Communication networks, Real-Time Systems , Tagged: Clock synchronization, Hough transform

Reinforcement learning in the automatic control area

December 22, 2015 17:41 , Juan-Antonio Fernández-Madrigal

Yu Jiang; Zhong-Ping Jiang, Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems, in Automatic Control, IEEE Transactions on , vol.60, no.11, pp.2917-2929, Nov. 2015, DOI: 10.1109/TAC.2015.2414811.

This paper presents a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB) equation to an optimization problem, which is solved via a new policy iteration method. The proposed method distinguishes from previously known nonlinear ADP methods in that the neural network approximation is avoided, giving rise to significant computational improvement. Instead of semiglobally or locally stabilizing, the resultant control policy is globally stabilizing for a general class of nonlinear polynomial systems. Furthermore, in the absence of the a priori knowledge of the system dynamics, an online learning method is devised to implement the proposed policy iteration technique by generalizing the current ADP theory. Finally, three numerical examples are provided to validate the effectiveness of the proposed method.

Posted in: Applications of reinforcement learning to control engineering , Tagged: Adaptive Dynamic Programming, Reinforcement learning

The quick-intuition vs. slow-deliberation dilemma from a decision-making perspective

December 22, 2015 17:36 , Juan-Antonio Fernández-Madrigal

Y-Lan Boureau, Peter Sokol-Hessner, Nathaniel D. Daw, Deciding How To Decide: Self-Control and Meta-Decision Making, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 700-710, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.013.

Many different situations related to self control involve competition between two routes to decisions: default and frugal versus more resource-intensive. Examples include habits versus deliberative decisions, fatigue versus cognitive effort, and Pavlovian versus instrumental decision making. We propose that these situations are linked by a strikingly similar core dilemma, pitting the opportunity costs of monopolizing shared resources such as executive functions for some time, against the possibility of obtaining a better outcome. We offer a unifying normative perspective on this underlying rational meta-optimization, review how this may tie together recent advances in many separate areas, and connect several independent models. Finally, we suggest that the crucial mechanisms and meta-decision variables may be shared across domains.

Posted in: Cognitive sciences , Tagged: Decision making, Intuition vs. deliberation

A possible framework for the relationship between culture, behavior and the brain

December 22, 2015 17:33 , Juan-Antonio Fernández-Madrigal

Shihui Han, Yina Ma, A Culture–Behavior–Brain Loop Model of Human Development, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 666-676, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.010.

Increasing evidence suggests that cultural influences on brain activity are associated with multiple cognitive and affective processes. These findings prompt an integrative framework to account for dynamic interactions between culture, behavior, and the brain. We put forward a culture–behavior–brain (CBB) loop model of human development that proposes that culture shapes the brain by contextualizing behavior, and the brain fits and modifies culture via behavioral influences. Genes provide a fundamental basis for, and interact with, the CBB loop at both individual and population levels. The CBB loop model advances our understanding of the dynamic relationships between culture, behavior, and the brain, which are crucial for human phylogeny and ontogeny. Future brain changes due to cultural influences are discussed based on the CBB loop model.

Posted in: Cognitive sciences , Tagged: Culture

On how moral can shape perception

December 22, 2015 17:30 , Juan-Antonio Fernández-Madrigal

Ana P. Gantman, Jay J. Van Bavel,Moral Perception, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 631-633, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.004.

Based on emerging research, we propose that human perception is preferentially attuned to moral content. We describe how moral concerns enhance detection of morally relevant stimuli, and both command and direct attention. These perceptual processes, in turn, have important consequences for moral judgment and behavior.

Posted in: Cognitive sciences , Tagged: Moral

Scheduling of communications between several nodes for better achieving real-time constraints in a distributed control system, and also a very detailed dynamical model of a wheeled vehicle

November 12, 2015 20:08 , Juan-Antonio Fernández-Madrigal

Naim Bajcinca, Wireless cars: A cyber-physical approach to vehicle dynamics control, Mechatronics, Volume 30, September 2015, Pages 261-274, ISSN 0957-4158, DOI: 10.1016/j.mechatronics.2015.04.016.

A non-conventional drive-by-wireless technology for guidance and control of a redundantly actuated electric car supported by an on-board wireless network of sensors, actuators and control units is proposed in this article. Several optimization-based distributed feedforward control schemes are developed for such powertrain infrastructures. In view of the limitations of the commercial off-the-shelf wireless communication technologies and the harshness of the in-vehicle environments, a pressing design and implementation aspect, in addition to the robustness against information loss, refers to fulfilling the hard real-time computational requirements. In this work, we address such problems by introducing several distributed event-based control schemes in conjunction with adaptive scheduling at the protocol level. Hereby we obtain a simple tuning mechanism to compromise between the outcome accuracy and computation efficiency (i.e., communication traffic intensity). Using simulative evaluations, we demonstrate the viability of the proposed algorithms and illustrate the impact of external interferences in an IEEE 802.15.4 based wireless communication solution.

Posted in: Communication networks, Task scheduling , Tagged: Communication regulation, Communication scheduling, Dynamical model

Study of how a complex motion planning problem solved through RRT can benefit from parallelization

November 12, 2015 19:59 , Juan-Antonio Fernández-Madrigal

Brian W. Satzinger, Chelsea Lau, Marten Byl, Katie Byl, Tractable locomotion planning for RoboSimian, The International Journal of Robotics Research November 2015 vol. 34 no. 13 1541-1558, DOI: 10.1177/0278364915584947.

This paper investigates practical solutions for low-bandwidth, teleoperated mobility for RoboSimian in complex environments. Locomotion planning for this robot is challenging due to kinematic redundancy. We present an end-to-end planning method that exploits a reduced-dimension rapidly-exploring random tree search, constraining a subset of limbs to an inverse kinematics table. Then, we evaluate the performance of this approach through simulations in randomized environments and in the style of the Defense Advanced Research Projects Agency Robotics Challenges terrain both in simulation and with hardware.
We also illustrate the importance of allowing for significant body motion during swing leg motions on extreme terrain and quantify the trade-offs between computation time and execution time, subject to velocity and acceleration limits of the joints. These results lead us to hypothesize that appropriate statistical “investment” of parallel computing resources between competing formulations or flavors of random planning algorithms can improve motion planning performance significantly. Motivated by the need to improve the speed of limbed mobility for the Defense Advanced Research Projects Agency Robotics Challenge, we introduce one formulation of this resource allocation problem as a toy example and discuss advantages and implications of such trajectory planning for tractable locomotion on complex terrain.

Posted in: Robot motion planning , Tagged: Parallelization, RRT

Using MDPs when the transition probability matrix is just partially specified, therefore getting closer to a model-free approach

November 12, 2015 19:42 , Juan-Antonio Fernández-Madrigal

Karina V. Delgado, Leliane N. de Barros, Daniel B. Dias, Scott Sanner, Real-time dynamic programming for Markov decision processes with imprecise probabilities, Artificial Intelligence, Volume 230, January 2016, Pages 192-223, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.09.005.

Markov Decision Processes have become the standard model for probabilistic planning. However, when applied to many practical problems, the estimates of transition probabilities are inaccurate. This may be due to conflicting elicitations from experts or insufficient state transition information. The Markov Decision Process with Imprecise Transition Probabilities (MDP-IPs) was introduced to obtain a robust policy where there is uncertainty in the transition. Although it has been proposed a symbolic dynamic programming algorithm for MDP-IPs (called SPUDD-IP) that can solve problems up to 22 state variables, in practice, solving MDP-IP problems is time-consuming. In this paper we propose efficient algorithms for a more general class of MDP-IPs, called Stochastic Shortest Path MDP-IPs (SSP MDP-IPs) that use initial state information to solve complex problems by focusing on reachable states. The (L)RTDP-IP algorithm, a (Labeled) Real Time Dynamic Programming algorithm for SSP MDP-IPs, is proposed together with three different methods for sampling the next state. It is shown here that the convergence of (L)RTDP-IP can be obtained by using any of these three methods, although the Bellman backups for this class of problems prescribe a minimax optimization. As far as we are aware, this is the first asynchronous algorithm for SSP MDP-IPs given in terms of a general set of probability constraints that requires non-linear optimization over imprecise probabilities in the Bellman backup. Our results show up to three orders of magnitude speedup for (L)RTDP-IP when compared with the SPUDD-IP algorithm.

See also:

Karina Valdivia Delgado, Scott Sanner, Leliane Nunes de Barros, Efficient solutions to factored MDPs with imprecise transition probabilities, Artif. Intell. 175 (9–10) (2011) 1498–1527.
Satia, J. K., and Lave Jr., R. E. 1970. MDPs with uncertain transition probabilities. Operations Research 21:728–740
White III, C. C., and El-Deib, H. K. 1994. MDPs with Imprecise Transition Probabilities. Operations Research 42(4):739–749

Posted in: Artificial Intelligence , Tagged: MDPs, Task planning

Nice summary of reinforcement learning in control (Adaptive Dynamic Programming) and the use of Q-learning plus NN approximators for solving a control problem under a game theory framework

October 19, 2015 11:04 , Juan-Antonio Fernández-Madrigal

Kyriakos G. Vamvoudakis, Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems, Automatica, Volume 61, November 2015, Pages 274-281, ISSN 0005-1098, DOI: 10.1016/j.automatica.2015.08.017.

This work proposes a novel Q-learning algorithm to solve the problem of non-zero sum Nash games of linear time invariant systems with N -players (control inputs) and centralized uncertain/unknown dynamics. We first formulate the Q-function of each player as a parametrization of the state and all other the control inputs or players. An integral reinforcement learning approach is used to develop a model-free structure of N -actors/ N -critics to estimate the parameters of the N -coupled Q-functions online while also guaranteeing closed-loop stability and convergence of the control policies to a Nash equilibrium. A 4th order, simulation example with five players is presented to show the efficacy of the proposed approach.

Posted in: Applications of reinforcement learning to control engineering , Tagged: Adaptive Dynamic Programming, Game theory, Neural networks, Q-learning, Reinforcement learning

Electronic circuit for harvesting energy autonomously in a multi-sensor device

October 13, 2015 09:09 , Juan-Antonio Fernández-Madrigal

Dias, P.C.; Morais, F.J.O.; de Morais Franca, M.B.; Ferreira, E.C.; Cabot, A.; Siqueira Dias, J.A., Autonomous Multisensor System Powered by a Solar Thermoelectric Energy Harvester With Ultralow-Power Management Circuit, in Instrumentation and Measurement, IEEE Transactions on , vol.64, no.11, pp.2918-2925, Nov. 2015, DOI: 10.1109/TIM.2015.2444253.

An autonomous multisensor system powered by an energy harvester fabricated with a flat-panel solar thermoelectric generator with an ultralow-power management circuit is presented. The multisensor system was tested in an agricultural application, where every 15 min the values of the temperature, air humidity, and solar radiation have to be measured and stored in a mass memory device (a Secure Digital card), with their respective time stamp. The energy-harvesting switching dc-dc converter is based on a low-input-voltage commercial integrated circuit (LTC3108), which charges a 1.65-F supercapacitor up to 5.0 V. A novel ultralow-power management circuit was developed to replace the internal power management circuitry of the LTC3108, and using this circuit, the operation of the system when no energy can be harvested from the environment is extended from 136 h to more than 266 h. The solar thermoelectric generator used for the energy harvesting is composed of a bismuth telluride thermoelectric generator with a 110-mV/°C Seebeck coefficient sandwiched between a 40 cm \times 40 cm anodized aluminum flat panel and an aluminum heatsink. On a sunny winter day in the southern hemisphere (12 August 2014, at Campinas, SP—Brazil, Latitude: 22° 54’), the energy supplied by the harvesting system to the supercapacitor was 7 J.

Posted in: Electronics , Tagged: Autonomy, Energy harvesting

« Previous 1 … 69 70 71 72 73 … 80 Next »

Improvement on the classical regression-based estimation algorithm of the relative clock frequency of two remotely connected clocks for better behaviour under outliers, and a good related works section on the estimation of clock relative frequency

Oka Saputra, K.; Wei-Chung Teng; Tsung-Han Chen, Hough Transform-Based Clock Skew Measurement Over Network, in Instrumentation and Measurement, IEEE Transactions on , vol.64, no.12, pp.3209-3216, Dec. 2015, DOI: 10.1109/TIM.2015.2450293.

Reinforcement learning in the automatic control area

Yu Jiang; Zhong-Ping Jiang, Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems, in Automatic Control, IEEE Transactions on , vol.60, no.11, pp.2917-2929, Nov. 2015, DOI: 10.1109/TAC.2015.2414811.

The quick-intuition vs. slow-deliberation dilemma from a decision-making perspective

Y-Lan Boureau, Peter Sokol-Hessner, Nathaniel D. Daw, Deciding How To Decide: Self-Control and Meta-Decision Making, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 700-710, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.013.

A possible framework for the relationship between culture, behavior and the brain

Shihui Han, Yina Ma, A Culture–Behavior–Brain Loop Model of Human Development, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 666-676, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.010.

On how moral can shape perception

Ana P. Gantman, Jay J. Van Bavel,Moral Perception, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 631-633, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.004.

Scheduling of communications between several nodes for better achieving real-time constraints in a distributed control system, and also a very detailed dynamical model of a wheeled vehicle

Naim Bajcinca, Wireless cars: A cyber-physical approach to vehicle dynamics control, Mechatronics, Volume 30, September 2015, Pages 261-274, ISSN 0957-4158, DOI: 10.1016/j.mechatronics.2015.04.016.

Study of how a complex motion planning problem solved through RRT can benefit from parallelization

Brian W. Satzinger, Chelsea Lau, Marten Byl, Katie Byl, Tractable locomotion planning for RoboSimian, The International Journal of Robotics Research November 2015 vol. 34 no. 13 1541-1558, DOI: 10.1177/0278364915584947.

Using MDPs when the transition probability matrix is just partially specified, therefore getting closer to a model-free approach

Karina V. Delgado, Leliane N. de Barros, Daniel B. Dias, Scott Sanner, Real-time dynamic programming for Markov decision processes with imprecise probabilities, Artificial Intelligence, Volume 230, January 2016, Pages 192-223, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.09.005.

Nice summary of reinforcement learning in control (Adaptive Dynamic Programming) and the use of Q-learning plus NN approximators for solving a control problem under a game theory framework

Kyriakos G. Vamvoudakis, Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems, Automatica, Volume 61, November 2015, Pages 274-281, ISSN 0005-1098, DOI: 10.1016/j.automatica.2015.08.017.

Electronic circuit for harvesting energy autonomously in a multi-sensor device

Post Navigation

Fields, areas and lines of research

Archives

Oka Saputra, K.; Wei-Chung Teng; Tsung-Han Chen, Hough Transform-Based Clock Skew Measurement Over Network, in Instrumentation and Measurement, IEEE Transactions on , vol.64, no.12, pp.3209-3216, Dec. 2015, DOI: 10.1109/TIM.2015.2450293.

Yu Jiang; Zhong-Ping Jiang, Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems, in Automatic Control, IEEE Transactions on , vol.60, no.11, pp.2917-2929, Nov. 2015, DOI: 10.1109/TAC.2015.2414811.

Y-Lan Boureau, Peter Sokol-Hessner, Nathaniel D. Daw, Deciding How To Decide: Self-Control and Meta-Decision Making, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 700-710, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.013.

Shihui Han, Yina Ma, A Culture–Behavior–Brain Loop Model of Human Development, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 666-676, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.010.

Ana P. Gantman, Jay J. Van Bavel,Moral Perception, Trends in Cognitive Sciences, Volume 19, Issue 11, November 2015, Pages 631-633, ISSN 1364-6613, DOI: 10.1016/j.tics.2015.08.004.

Naim Bajcinca, Wireless cars: A cyber-physical approach to vehicle dynamics control, Mechatronics, Volume 30, September 2015, Pages 261-274, ISSN 0957-4158, DOI: 10.1016/j.mechatronics.2015.04.016.

Brian W. Satzinger, Chelsea Lau, Marten Byl, Katie Byl, Tractable locomotion planning for RoboSimian, The International Journal of Robotics Research November 2015 vol. 34 no. 13 1541-1558, DOI: 10.1177/0278364915584947.

Karina V. Delgado, Leliane N. de Barros, Daniel B. Dias, Scott Sanner, Real-time dynamic programming for Markov decision processes with imprecise probabilities, Artificial Intelligence, Volume 230, January 2016, Pages 192-223, ISSN 0004-3702, DOI: 10.1016/j.artint.2015.09.005.

Kyriakos G. Vamvoudakis, Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems, Automatica, Volume 61, November 2015, Pages 274-281, ISSN 0005-1098, DOI: 10.1016/j.automatica.2015.08.017.

Post Navigation

Fields, areas and lines of research

Transversal topics, methods and tools

Archives