v2 huggingface support #546

montanalow · 2023-02-15T00:20:43Z

No description provided.

santiatpml · 2023-03-14T17:02:05Z

In transformers.py, model has to be resized to to length of new tokenizer model.resize_token_embeddings(len(tokenizer)) after line 298 model = AutoModelForCausalLM.from_pretrained(model_name)
We could reuse transform function and pipeline for different tasks instead of generate to keep the interface consistent between hugging face models and fine tuned models.

montanalow · 2023-03-14T18:18:21Z

In transformers.py, model has to be resized to to length of new tokenizer model.resize_token_embeddings(len(tokenizer)) after line 298 model = AutoModelForCausalLM.from_pretrained(model_name)

👍

We could reuse transform function and pipeline for different tasks instead of generate to keep the interface consistent between hugging face models and fine tuned models.

I think there are a few reasons to create a new API:

Fine tuned models need a new name. You may fine tune gpt2 on several different tasks, so you need a way to specify which gpt2... we could upload these back to huggingface, and use your_username/gpt2-special-tuning, but it seems just as natural to use our existing project_name, since you'll also likely want to test many different base models for fine tuning in the same Project.
transform takes whatever arbitrary JSON (possibly omitting a model name in favor of a task, which is similar but not the same as a PostgresML deployment), and returns whatever arbitrary JSON that happens to work with the Python call. generate is a more specific API that we know should take a string input and output a string. There are further structured APIs I think we'll want for certain tasks, e.g. predict and predict_logits for text-classification, or functions to get embeddings out of models.

I wouldn't be against extending the PostgresML transform function to accept a fine tuned model name, instead of a hugging face model name, but we'd need a way to disambiguate which model store (huggingface vs fine tuned PostgresML) to reference when a model name is passed.

montanalow changed the title ~~new task types~~ v2 huggingface support Feb 15, 2023

montanalow force-pushed the montana/huggingface branch from 15c1b3e to 995e0b2 Compare March 1, 2023 00:31

montanalow added 8 commits March 13, 2023 20:07

new task types

2830c52

data loading in rust

6c34d1f

cargo fmt

eed1094

use dataset feature types for schema definition

2475a2c

checkpoint

47ffd01

training checkpoint

39f2c9f

metrics and deploys

b7439f4

generate works

1574e72

montanalow force-pushed the montana/huggingface branch from fe23f7d to 1574e72 Compare March 14, 2023 03:07

remove dead code

1d114b2

montanalow marked this pull request as ready for review March 14, 2023 03:27

text-generation task

e13896f

montanalow requested a review from santiatpml March 14, 2023 16:39

montanalow added 3 commits March 14, 2023 15:31

add perplexity metric

93799f1

update dashboard metrics

6473a89

manual deploys for huggingface

52248e1

montanalow merged commit c2cf3a2 into master Mar 15, 2023

montanalow deleted the montana/huggingface branch March 15, 2023 17:05

SilasMarvin pushed a commit that referenced this pull request Oct 5, 2023

v2 huggingface support (#546)

e31981b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v2 huggingface support #546

v2 huggingface support #546

Uh oh!

montanalow commented Feb 15, 2023

Uh oh!

santiatpml commented Mar 14, 2023

Uh oh!

montanalow commented Mar 14, 2023

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

v2 huggingface support #546

v2 huggingface support #546

Uh oh!

Conversation

montanalow commented Feb 15, 2023

Uh oh!

santiatpml commented Mar 14, 2023

Uh oh!

montanalow commented Mar 14, 2023

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.