Add master/worker dynamic #11126

aliabid94 · 2025-05-02T22:39:22Z

Gradio apps can now have master/worker dynamics natively. A gradio app can be launched with the roles "master", "worker", or "hybrid" (default). A master app receives events and stores them in a queue and communicates updates to the client. A worker app communicates with the master to request tasks, process them, and reports results to the master. By default, gradio is launched in hybrid mode, so it does everything: receives tasks from the client, processes them, and then returns the result to client. A hybrid app can still have extra workers bind to it to help process tasks.

An example app:

import gradio as gr
import sys
import time

def greet(name, prog = gr.Progress()):
    print("worker here...")
    time.sleep(4)
    return "Hello " + name + "!"

with gr.Blocks() as demo:
    name = gr.Textbox(label="Name")
    output = gr.Textbox(label="Output Box")
    name.submit(fn=greet, inputs=name, outputs=output, api_name="greet")        

role = "master" if "-w" in sys.argv else "hybrid"

if __name__ == "__main__":
    demo.launch(role=role, master_url="http://localhost:7860/", app_key="test123")

The role kwarg can be "master", "worker", or "hybrid". Launch this as python run.py in one terminal to start the master, and python run.py -w to start the worker. You can attach multiple workers. You'll notice that without any workers, the master will not process any tasks, and just show the queue size. You can then add or remove workers to see how they handle tasks.

The master_url kwarg points a worker to the master to bind to it, and an app_key shared between masters and workers authenticates requests between them.

The purpose of this PR was to create a more mature autoscaling approach to gradio. Previously scaling was only possible via having nginx configs that required sticky sessions, so that the same IP address was served the same gradio server in a cluster, so that state was maintained. However, if a worker went offline, we would lose the tasks it was processing. Now, if a worker goes down, the master reassigns its tasks to a new worker.

With this approach, there's no need for an extra nginx config to do routing, and extra workers can be added without needing a separate IP address so that the nginx config can distinguish between them.

There are some parts of the app that are not stateless (gr.render, and any use of gr.state) and the data they hold is not JSONifyable, so in that case, gradio will internally use session_id to replicate "sticky sessions" - the same worker will be assigned all requests for a session_id. If a worker goes down, this data will be lost.

Ali Abid added 2 commits May 2, 2025 15:19

changes

57da056

changes

730f198

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add master/worker dynamic #11126

Add master/worker dynamic #11126

Uh oh!

aliabid94 commented May 2, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Add master/worker dynamic #11126

Are you sure you want to change the base?

Add master/worker dynamic #11126

Uh oh!

Conversation

aliabid94 commented May 2, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.