Content-Length: 302353 | pFad | http://github.com/postgresml/postgresml/pull/1063

84 vllm support by kczimm · Pull Request #1063 · postgresml/postgresml · GitHub
Skip to content

vllm support #1063

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 9 commits into
base: master
Choose a base branch
from
Open

vllm support #1063

wants to merge 9 commits into from

Conversation

kczimm
Copy link
Contributor

@kczimm kczimm commented Oct 11, 2023

Enables support for vLLM. To use, you must specify the model field in the task parameter of the pgml.transform function and you must add "backend": "vllm" in the task parameters. For example,

SELECT * FROM pgml.transform(
    task => '{"model":"tiiuae/falcon-7b","backend":"vllm"}'::JSONB,
    inputs => Array['hello']
);

A list of supported models for vLLM can be found here.

Only one vLLM model can be loaded per client connection process due to a limitation in vLLM. The first call to pgml.transform with a given model will load the model ("cold start"), but subsequent calls will use the cached model. If you change the specified model in the same client connection, the cached model will be replaced with the new one.

@kczimm kczimm marked this pull request as ready for review October 19, 2023 20:46
@levkk
Copy link
Contributor

levkk commented Oct 19, 2023

Rebase on master to get #1102 which should fix the tests.

@kczimm kczimm force-pushed the kczimm-vllm-support branch from 5e20276 to aca505c Compare October 19, 2023 21:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/postgresml/postgresml/pull/1063

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy