pith. machine review for the scientific record. sign in

arxiv: 1807.05118 · v1 · submitted 2018-07-13 · 💻 cs.LG · cs.DC· stat.ML

Recognition: unknown

Tune: A Research Platform for Distributed Model Selection and Training

Authors on Pith no claims yet
classification 💻 cs.LG cs.DCstat.ML
keywords algorithmssearchtunedistributedhyperparametermodelselectiontraining
0
0 comments X
read the original abstract

Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a unified framework for model selection and training that provides a narrow-waist interface between training scripts and search algorithms. We show that this interface meets the requirements for a broad range of hyperparameter search algorithms, allows straightforward scaling of search to large clusters, and simplifies algorithm implementation. We demonstrate the implementation of several state-of-the-art hyperparameter search algorithms in Tune. Tune is available at http://ray.readthedocs.io/en/latest/tune.html.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CDS4RAG: Cyclic Dual-Sequential Hyperparameter Optimization for RAG

    cs.LG 2026-05 unverdicted novelty 7.0

    CDS4RAG cyclically optimizes full RAG hyperparameters by distinguishing and alternating between retriever and generator components, boosting performance up to 1.54x over prior methods on benchmarks.

  2. PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts

    cs.CL 2026-05 unverdicted novelty 6.0

    PEML co-optimizes continuous prompts and low-rank adaptations to deliver up to 6.67% average accuracy gains over existing multi-task PEFT methods on GLUE, SuperGLUE, and other benchmarks.

  3. Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring

    cs.AI 2026-04 unverdicted novelty 5.0

    Deep Researcher Agent is a framework for autonomous 24/7 deep learning experimentation by LLM agents using zero-cost monitoring, constant-size memory, and a minimal-toolset multi-agent design.

  4. Prediction of Magnetic Flux Evolution During Solar Active Region Emergence using Long Short-Term Memory Networks

    astro-ph.SR 2026-04 unverdicted novelty 5.0

    Standard LSTM networks predict solar active region magnetic flux evolution 3-10 hours ahead from intensity and oscillation maps, outperforming encoder-decoder variants on held-out test regions.

  5. Chrono::Ray: A Distributed Framework for High-Throughput Simulation-Based Analysis of Multibody Systems

    cs.CE 2026-05 accept novelty 4.0

    Chrono::Ray integrates Chrono and Ray into an open-source framework that enables scalable, user-friendly orchestration of large ensembles of multibody dynamics simulations.

  6. Optimization with SpotOptim

    cs.LG 2026-04 unverdicted novelty 3.0

    spotoptim is an open-source Python package that implements a Kriging-based optimization loop with Expected Improvement, mixed-variable support, noise handling via OCBA, parallelization, and restart mechanisms for blac...