Model based path planning using Q-Learning

Avinash Sharma, Kanika Gupta, Anirudha Kumar, Aishwarya Sharma, Rajesh Kumar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Citations (Scopus)

Abstract

Though the classical robotics is highly proficient in accomplishing a lot of complex tasks, still it is far from exhibiting the human-like natural intelligence in terms of flexibility and reliability to work in dynamic scenarios. In order to render these qualities in the robots, reinforcement learning could prove to be quite effective. By employing learning based training provided by reinforcement learning methods, a robot can be made to learn to work in previously unforeseen situations. Still this learning task can be quite cumbersome due to its requirement of the huge amount of training data which makes the training quite inefficient in the real world scenarios. The paper proposes a model based path planning method using the e greedy based Q-learning. The scenario was modeled using a grid-world based simulator which is being used in the initial training of the agent. The trained policy is then improved to learn the real world dynamics by using the real world samples. This study proves the efficiency and reliability of the simulator-based training methodology.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Industrial Technology, ICIT 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages837-842
Number of pages6
ISBN (Electronic)9781509053209
DOIs
Publication statusPublished - 26 Apr 2017
Externally publishedYes
Event2017 IEEE International Conference on Industrial Technology, ICIT 2017 - Toronto, Canada
Duration: 23 Mar 201725 Mar 2017

Publication series

NameProceedings of the IEEE International Conference on Industrial Technology

Conference

Conference2017 IEEE International Conference on Industrial Technology, ICIT 2017
Country/TerritoryCanada
CityToronto
Period23/03/1725/03/17

Keywords

  • Grid-World
  • Model Based Control
  • Neural Network
  • Q-learning
  • Reinforcement Learning

ASJC Scopus subject areas

  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Model based path planning using Q-Learning'. Together they form a unique fingerprint.

Cite this