PTDRL: Parameter Tuning Using Deep Reinforcement Learning

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Scopus citations

    Abstract

    A variety of autonomous navigation algorithms exist that allow robots to move around in a safe and fast manner. Many of these algorithms require parameter re-tuning when facing new environments. In this paper, we propose PTDRL, a parameter-tuning strategy that adaptively selects from a fixed set of parameters those that maximize the expected reward for a given navigation system. Our learning strategy can be used for different environments, different platforms, and different user preferences. Specifically, we attend to the problem of social navigation in indoor spaces, using a classical motion planning algorithm as our navigation system and training its parameters to optimize its behavior. Experimental results show that PTDRL can outperform other online parameter-tuning strategies.

    Original languageEnglish
    Title of host publication2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023
    PublisherInstitute of Electrical and Electronics Engineers
    Pages11356-11362
    Number of pages7
    ISBN (Electronic)9781665491907
    DOIs
    StatePublished - 1 Jan 2023
    Event2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023 - Detroit, United States
    Duration: 1 Oct 20235 Oct 2023

    Publication series

    NameIEEE International Conference on Intelligent Robots and Systems
    ISSN (Print)2153-0858
    ISSN (Electronic)2153-0866

    Conference

    Conference2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023
    Country/TerritoryUnited States
    CityDetroit
    Period1/10/235/10/23

    ASJC Scopus subject areas

    • Control and Systems Engineering
    • Software
    • Computer Vision and Pattern Recognition
    • Computer Science Applications

    Fingerprint

    Dive into the research topics of 'PTDRL: Parameter Tuning Using Deep Reinforcement Learning'. Together they form a unique fingerprint.

    Cite this