Workstation capacity tuning using reinforcement learning

Aharon Bar-Hillel, Amir Di-Nur, Liat Ein-Dor, Ran Gilad-Bachrach, Yossi Ittach

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Computer grids are complex, heterogeneous, and dynamic systems, whose behavior is governed by hundreds of manuallytuned parameters. As the complexity of these systems grows, automating the procedure of parameter tuning becomes indispensable. fn this paper, we consider the problem of autotuning server capacity, i.e. the number of jobs a server runs in parallel. We present three different reinforcement learning algorithms, which generate a dynamic policy by changing the number of concurrent running jobs according to the job types and machine state. The algorithms outperform manually-tuned policies for the entire range of checked workloads, with average throughput improvement greater than 20%. On multi-core servers, the average throughput improvement is approximately 40%, which hints at the enormous improvement potential of such a tuning mechanism with the gradual transition to multi-core machines. (c) 2007 ACM.

Original languageEnglish
Title of host publicationProceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07
DOIs
StatePublished - 1 Dec 2007
Externally publishedYes
Event2007 ACM/IEEE Conference on Supercomputing, SC'07 - Reno, NV, United States
Duration: 10 Nov 200716 Nov 2007

Publication series

NameProceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07

Conference

Conference2007 ACM/IEEE Conference on Supercomputing, SC'07
Country/TerritoryUnited States
CityReno, NV
Period10/11/0716/11/07

Fingerprint

Dive into the research topics of 'Workstation capacity tuning using reinforcement learning'. Together they form a unique fingerprint.

Cite this