Adds pipeline model caching in the `transform` function. #593

f-prime · 2023-04-16T18:49:15Z

This PR adds the ability to run a query that looks like this:

SELECT pgml.transform(
        '{"model": "roberta-large-mnli"}'::JSONB, 
        inputs => ARRAY[
            'I love how amazingly simple ML has become!', 
            'I hate doing mundane and thankless tasks. ☹️'
        ],
        cache => TRUE
    ) AS positivity;

roberta-large-mnli will be cached in memory to prevent transformers.pipeline() from being called more than once for the same model.

By default, cache is FALSE.

…faster.

montanalow · 2023-04-16T23:52:49Z

pgml-extension/src/bindings/transformers.py

    task = json.loads(task)
    args = json.loads(args)
    inputs = json.loads(inputs)

-    pipe = transformers.pipeline(**task)
+    model = task.get("model")


I think there may be different pipelines with the same model, to handle different tasks. We may need to use the full task param for caching. Something like:

if cache: key = ','.join([str(key) + ':' + str(value) for (key, value) in sorted(task.items())]) if key not in __cache_transformer_by_task: __cache_transformer_by_task[key] = transformers.pipeline(**task) pipe = __cache_transformer_by_task[key] else: pipe = transformers.pipeline(**task)

I see, good point. I'll think some more about the parameters of .pipeline() and push an update.

Okay, I pretty much copied your code verbatim. I also like the idea in there where you have to pass the cache parameter also to USE the cached model. My initial version always used the cached version if it was available whether or not the cache flag was passed in.

montanalow · 2023-04-17T16:19:17Z

This looks good to me as an improvement over the status quo, so I'm going to merge it. We're being slightly inconsistent, i.e. every other API caches by default, including the pgml.embed() which may use similarly large models. I'm not really content with that approach either since the only way to clear those caches is to drop the connection and kill the backend process. It's good to have these examples though, that we'll want to design a consistent interface around for more feature rich cache management.

Adds ability to cache models to make subsequent calls to transform …

abc9160

…faster.

f-prime force-pushed the master branch from bcd6c2d to abc9160 Compare April 16, 2023 18:50

montanalow reviewed Apr 16, 2023

View reviewed changes

Caching based on task parameters instead of just the model name

eda180c

f-prime force-pushed the master branch from 78544fd to eda180c Compare April 17, 2023 04:38

montanalow merged commit 41de0aa into postgresml:master Apr 17, 2023

SilasMarvin pushed a commit that referenced this pull request Oct 5, 2023

Adds pipeline model caching in the transform function. (#593)

8f5d823

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds pipeline model caching in the `transform` function. #593

Adds pipeline model caching in the `transform` function. #593

Uh oh!

f-prime commented Apr 16, 2023 •

edited

Loading

Uh oh!

montanalow Apr 16, 2023

Uh oh!

f-prime Apr 17, 2023

Uh oh!

f-prime Apr 17, 2023 •

edited

Loading

Uh oh!

montanalow commented Apr 17, 2023

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Adds pipeline model caching in the transform function. #593

Adds pipeline model caching in the transform function. #593

Uh oh!

Conversation

f-prime commented Apr 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

montanalow Apr 16, 2023

Choose a reason for hiding this comment

Uh oh!

f-prime Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

f-prime Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

montanalow commented Apr 17, 2023

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Adds pipeline model caching in the `transform` function. #593

Adds pipeline model caching in the `transform` function. #593

f-prime commented Apr 16, 2023 •

edited

Loading

f-prime Apr 17, 2023 •

edited

Loading