Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

Kfir Arviv, Helman Stern, Yael Edan

Research output: Contribution to journalArticlepeer-review

50 Scopus citations

Abstract

A two-robot flow-shop scheduling problem with n identical jobs and m machines is defined and evaluated for four robot collaboration levels corresponding to different levels of information sharing, learning and assessment: Full-robots work together, performing self and joint learning sharing full information; Pull-one robot decides when and if to learn from the other robot; Push-one robot may force the second to learn from it and None-each robot learns independently with no information sharing. Robots operate on parallel tracks, transporting jobs between successive machines, returning empty to a machine to move another job. The objective is to obtain a robot schedule that minimises makespan (Cmax) for machines with varying processing times. A new reinforcement learning algorithm is developed, using dual Q-learning functions. A novel feature in the collaborative algorithm is the assignment of different reward functions to robots; minimising robot idle time and minimising job waiting time. Such delays increase makespan. Simulation analyses with fast, medium and slow speed robots indicated that Full collaboration with a fast-fast robot pair was best according to minimum average upper bound error. The new collaborative algorithm provides a tool for finding optimal and near-optimal solutions to difficult collaborative multi-robot scheduling problems.

Original languageEnglish
Pages (from-to)1196-1209
Number of pages14
JournalInternational Journal of Production Research
Volume54
Issue number4
DOIs
StatePublished - 16 Feb 2016

Keywords

  • Job-transfer robots
  • collaboration
  • flow-shop
  • identical and non-identical robots
  • reinforcement learning
  • scheduling

ASJC Scopus subject areas

  • Strategy and Management
  • Management Science and Operations Research
  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem'. Together they form a unique fingerprint.

Cite this