Enable External Launcher Support for vLLM in TRL for Efficient GRPO Training #3064

mtoslalibu · 2025-03-12T13:24:09Z

Feature request

vLLM has introduced support for an external launcher, enabling vLLM processes to be co-located with other processes, such as training. By running multiple vLLM instances alongside the training process, we can improve inference process, reducing the time required for GRPO training. I propose adding an option in TRL to spawn vLLM processes per GPU using its external launcher.

Motivation

Efficient GRPO relies heavily on fast and scalable inference. Currently, inference and training processes are executed separately, introducing bottlenecks that slow down training. The ideal is to enable multiple vLLM instances in the training process as done by the others like OpenRLHF and VERL.

With vLLM's newly introduced external launcher (PR #12071), it is now possible to co-locate vLLM instances with training processes, allowing spawn vLLM instances to run per GPU. This reduces inference latency, leading to shorter training durations.

By integrating vLLM’s external launcher into TRL, we can enhance distributed inference efficiency and accelerate GRPO training, making large-scale reinforcement learning more practical and scalable.

Your contribution

Modify GRPO_trainer to enable initialization of vllm via external launcher - if TRL flag (such as self.args.external_launcher) is provided. We are considering doing a RAY-less version, in which case the changes can be quite minimal.

github-actions bot added ✨ enhancement New feature or request 🏋 GRPO Related to GRPO ⚡accelerate Related to accelerate labels Mar 12, 2025

toslali-ibm mentioned this issue Mar 18, 2025

Co-Locating vLLM Instances with Training Processes Via External Launcher #3105

Closed

5 tasks

toslali-ibm mentioned this issue Mar 26, 2025

Co-Locating vLLM w/ training to achieve higher throughput and GPU utilization #3162

Closed

5 tasks

toslali-ibm mentioned this issue Apr 30, 2025

🧑‍🤝‍🧑 Co-Locating vLLM w/ training to for higher throughput and GPU utilization #3394

Merged

5 tasks

qgallouedec closed this as completed in #3394 May 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable External Launcher Support for vLLM in TRL for Efficient GRPO Training #3064

Enable External Launcher Support for vLLM in TRL for Efficient GRPO Training #3064

mtoslalibu commented Mar 12, 2025 •

edited

Loading

Enable External Launcher Support for vLLM in TRL for Efficient GRPO Training #3064

Enable External Launcher Support for vLLM in TRL for Efficient GRPO Training #3064

Comments

mtoslalibu commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Feature request

Motivation

Your contribution

mtoslalibu commented Mar 12, 2025 •

edited

Loading