Skip to main navigation Skip to search Skip to main content

PTDRLHF: Parameter Tuning Using Deep Reinforcement Learning with Human Feedback

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Many autonomous navigation algorithms require parameter re-tuning when facing new environments. This paper presents PTDRLHF, a parameter-tuning strategy that combines the Reinforcement Learning (RL)-based parameter tuning approach of Parameter Tuning using Deep Reinforcement Learning (PTDRL) [1] with human feedback to adaptively select from a predetermined set of parameters for a given navigation system in the context of social navigation. Our learning strategy is motivated by techniques for training language models using human feedback (HF) [2]. To the best of our knowledge, we are the first to implement an RLHF method for dynamic tuning in mobile navigation. In simulation, PTDRLHF preserves 20 % greater clearance from people and obstacles than PTDRL, with only a marginal decrease in average speed; in real-world trials, it outperforms the baseline on nearly all subjective evaluation measures.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 37th International Conference on Tools with Artificial Intelligence, ICTAI 2025
PublisherInstitute of Electrical and Electronics Engineers
Pages263-269
Number of pages7
ISBN (Electronic)9798331549190
DOIs
StatePublished - 1 Jan 2025
Event37th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2025 - Athens, Greece
Duration: 3 Nov 20255 Nov 2025

Publication series

NameProceedings - International Conference on Tools with Artificial Intelligence, ICTAI
ISSN (Print)1082-3409

Conference

Conference37th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2025
Country/TerritoryGreece
CityAthens
Period3/11/255/11/25

Keywords

  • Intelligent Robotics
  • Reinforcement Learning and Preference/Ranking

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'PTDRLHF: Parameter Tuning Using Deep Reinforcement Learning with Human Feedback'. Together they form a unique fingerprint.

Cite this