Skip to content

slavashvets/setfit-example-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

setfit-example-classifier

Example SetFit classification of job-vacancy postings using a small synthetic dataset.

Project goal

Demonstrate how to fine-tune a Sentence-Transformers encoder with the SetFit framework to distinguish “interesting” tech jobs (senior, remote, well-paid) from everything else.

Quick start

Taskfile handles task management, and uv manages the virtual environment and dependencies automatically (including python executable).

# Train the model and save the best checkpoint under models/
task train

# Evaluate on the held-out test set
task test

# List all available tasks
task --list

Dataset

Sixty short, hand-crafted job-ad snippets labelled 1 = interesting (e.g. “Senior Rust Developer — Remote — $150k”) 0 = uninteresting (e.g. “Call-centre agent — rotating shifts”). See the three CSVs under data/.

Model

  • Base encoder: all-MiniLM-L6-v2
  • Contrastive fine-tuning + linear classification head via SetFit
  • CPU-only by default — change device in main.py if you have a GPU.

License

Released under the MIT License – see LICENSE for details.

About

Example SetFit classification of job-vacancy

Resources

License

Stars

Watchers

Forks

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy