0% found this document useful (0 votes)

31 views11 pages

Model Fine Tuning Documentation

The document outlines the process of model fine-tuning, including model selection, dataset upload, hyperparameter tuning, inference, and evaluation metrics. It details various model types (predictive, recognition, generative) and fine-tuning techniques (SFT, DPO, KTO, PPO) along with their respective datasets and methodologies. Additionally, it discusses the importance of hyperparameters, inference processes, and evaluation metrics like perplexity, ROUGE, and BLEU for assessing model performance.

Uploaded by

owaisraza1704

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views11 pages

Model Fine Tuning Documentation

Uploaded by

owaisraza1704

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Model Fine Tuning Concepts

Introduction
Model Orchestration, MO for short is a component where we fine-tune the model for the specific
activities of the knowledge work.

There is a list of activities in the knowledge work and for each activity task, there is a recommended
model. We can fine-tune the recommended model or we can choose another model and fine-tune it.

Process

1. Model Selection

2 Upload Dataset

3 Hyperparams

4: Inference

5:Metrics and Evaluation

Explanation

Model Selection:

For each task of the knowledge work, we can choose the model based on the activity. In knowledge
work the base model specifies what type of the task it is, whether predictive, recognition or
generative

Thus for each task we have a model to select from, there is already a recommended model chosen at
the knowledge work step, but we can choose the model on our own.
Concepts in Model Selection:

Here we have 3 types of models based on the work: Predictive, recognition and Generative

Predictive model:

Makes predictions based on what has happened in the past.

Involves using statistical algorithms and machine learning techniques to analyze historical data and
make predictions about the future or unknown events.

Once model is trained, it can be use to make predictions on new data where the target variable is
unknown.

Use cases: Predictive tasks such as fraud detection, disease prediction,etc

Recognition model:

This is a model which focuses on identifying the patterns, features or attributes within a given
dataset

• These models are usually used for following use cases:

Image Recognition: Identifying objects, people, or scenes in images. For example, convolutional
neural networks (CNNs) are often used for tasks like facial recognition or identifying specific objects
in photos.

• Speech Recognition: Converting spoken language into text. This involves processing audio signals
and using models to recognize words and phrases. etc,etc...

Generative model:

Aims to learn the underlying patterns or attributes of data to generate new similar data
• Unlike discriminative models, which focus on distinguishing between different olasses or
categories, generative models learn the underlying distribution of the data and can create new
instances from that distribution

Used in various applications such as:

Image Generation: Creating new images that resemble a training set, such as generating realistic
photographs.. Generative Adversarial Networks (GANs) and Variational Autoericoders (VAEs) are
popular techniques used for this purpose

Text Generation: Producing coherent and contextually relevant text, such as writing stories, articles,
or dialogue. Models like GPT (Generative Pre-trained Transformer) are examples of generative
models for text…

Upload Dataset:

Here, we have to upload the dataset for training the model or specifically fine tuning with specific
task data.

The basic step is to upload a SFT(Supervised Fine Tuning) dataset for fine tuning the model. After we
have trained it with the SFT method then we can fine tune again using different techniques like
DPO,KTO,RM+PPO.

Concepts in Dataset upload:

What is SFT, different fine-tuning Techniques such as DPO, KTO, and RM+PPO?

SFT
SFT (Supervised Fine tuning) is a fine tuning technique where we fine tune an already trained model
like gpt, llama to our task specific so that the model aligns to our task and is better at answering the
prompts related to our task. SFT is just about using a paired dataset where inputs are mapped with
specific, meaning instead of giving a normal csv dataset , we give a fine tuned dataset with curated
input and outputs mapped together to fine-tune our model so that it gives better responses for our

outputs, which helps align our model better to our required preference.

Put simply, it's just a curated dataset according to the task.

The dataset used for SFT is in alpaca format which is pair of input output and instructions
Format Input: user input
Output: model response

DPO

First Understanding:
DIrect preference Optimization,
A collection of triplets that map specific inputs to desired output.
It fine tunes models to generate responses that are more aligned with human preferences.

Format of dataset,
Prompt:””
Preferred response:””
Unpreferred response:””

Second Search.
Optimizes model based on human preferences using direct feedback. Human preference
here means as to which output is better or more aligned with user’s goal.

How it works is:

- Human Preferences: instead of numeric labels and rewards, there is human
feedback in form of preferences. Ie, preferred response and unpreffered response.
- Preference pairs: a dataset is created containing pairs of output where there is one
preferred response and other one is unpreffered response.
- Training preferences: model is fine-tuned with preference pairs, which optimizes the
model to align with human preferences, and model learns to generate outputs which
are more likely to match the preferred response.

KTO

First understanding
Another type of preference dataset that can be used to train models to make decisions. The
model relies on simple binary preferences.

Second Search
Kahneman-Tversky Optimization
Aligns the model with human feedback. The method is inspired by principles of prospect
theory developed by Daniel Kahneman and Amos Tversky.

What is prospect theory?

Prospect theory is a psychological theory that explains how people make decisions when
faced with risk, uncertainty, or probability. It was developed by Daniel Kahneman and Amos
Tversky in 1979.
It highlights that human dont always act rationally or according to expected utility theory, (the
idea where we pick always mathematically optimal choice), instead people’s decision are
influenced by cognitive biases and psychological factors like:
- Loss Aversion: Feeling pain of loss more than gaining something of equal value
- Reference points: decision relative to a subjective reference point
- Non-Linear Perceptions of probability: Overestimation of small probability like winning
a lottery and underestimating large probabilities like getting into an accident.
- Diminishing sensitivity: Impact changes when move away from reference point.

Returning back to KTO,

How KTO works is by utility maximization, by aligning with human feedback. Unlike DPO,
which compares output, KTO optimizes based on a single feedback where each output
labeled as either desirable or undesirable.

Dataset format:
Input:””
Output:””
Label:1 or 0

Training process:
KTO uses a class function that focuses on maximizing the likelihood of outputs with high
utility (label:1) while penalizing outputs with low utility (label:0).
It incorporates somewhat of cognitive biases from prospect theory where it penalizes
undesired output more than rewarding desired outputs. (Loss Aversion).

PPO
Proximal Policy Optimization, is a reinforcement learning algorithm, which is designed to
optimize policy (agent’s strategy for choosing action based on its current state) while
ensuring stable learning and efficient training.
PPO is a part of family of policy gradient methods where goal is to directly optimize the
policy rather than the value function.
What is value function?
It estimates how good it is for the agent to be in particular state(or take action in a particular
state by predicting expected cumulative future rewards)

How it works is:

- Works iteratively improving the policy with respect to the feedback(rewards) that it
receives from the environment.
- The key idea is to make small updates to ensure that the policy doesnt change
drastically, a balance between exploration and exploitation, using a clipped objective
function to restrict how much policy changes from one udpate to another, ie,
proximal(near) update.

PPO by itself is a RL algorithm, so to fine tune the models it needs a reward, Thus to get the
rewards we use Reward Modeling along with it.
Why with RM?
The flow is ,
- firstly SFT sets the base of fine tuning
- Then reward modeling adds human preference alignment
- Then PPO fine tunes the model using Reward Model applied as RL algo to fine tune.
- PPO uses feedback from reward model to optimize the policy.
In short , PPO is just a reinforcement learning algo with requires Reward model to fine tune
the model.

Hyperparameters Selection

Concepts in Hyperparameters
Introduction
Hyperparameters are configuration variables that are manually set before training a model.
These are adjustable parameters that control the training process of a machine learning
model

Why is it important?
Hyperparameters are essential when fine-tuning a model. They significantly influence the
performance, efficiency, and effectiveness of the fine-tuning process. It defines key features
like model architecture, learning rate of the model, model complexity, etc. Model’s
performance depends heavily on hyperparameters.

What happens if we dont pass in any hyperparameters?

If you fine-tune a model without specifying hyperparameters, the process will rely on default
settings provided by the framework or library you're using.
Most libraries have default values for hyperparameters like learning rate, batch size, number
of epochs, optimizer type, etc.
These defaults are general-purpose and not optimized for your specific dataset, model, or
task. While this can work in some cases, it often leads to suboptimal results or wasted
computational resources.

Hyperparameter Tuning
It is the process of finding the configurations of hyperparameters that results in best
performance.
Usually the libraries gives you the list of hyperparameters to use when fine tuning with
specific technique.
But still there are techniques through which these hyperparameters are found.

Techniques:

1. GridSearchCV :
Brute force approach to fine the suitable HP. It fits the model using all possible
combination and see which are the ones giving better results

2. Randomized SearchCV:

This technique select values at random as opposed to grid search method’s use of
predetermined set of numbers. Every iteration, different set of hyperparameters are
selected and logs the performance of model.

3. Bayesian Optimization:

previous two techniques are often inefficient bcs they evaluate many HP
combinations without considering previous iteration results.This approach considers
previous evalutaiton when selecting the next HP combination and applies a
probabilistic function to choose the combination that will likely to yield the best result

Inference

Introduction
Inference is the process of running live data through a trained model to make a prediction or
solve a task. Up until now, the model was at the training/fine tuning part, where we were still
configuring the model based on our specific task, now Inference is when we test the Model’s
ability to generate output on real unseen data to get real time outputs.

vLLM
vLLM is a library used for LLM inference and serving.
It’s built to streamline the process of deploying and serving large models for inference,
focusing on optimizing performance in terms of speed, resource utilization. It is designed to
provide fast and efficient inference even for large models
Also serving large models consumes massive amount of memory, but vLLM optimizes
memory usage to allow models to be served efficiently on available hardware

Difference between Inference and Serving?

Inference is the process of using a trained model to make predictions on new, unseen data.
This is the phase where the model applies its learned patterns to generate outputs
Where as, Serving refers to the infrastructure and system that allows users or systems to
access a model for inference. It exposes a REST API for a web app to interact with a model
and get predictions.

Evaluation Metrics
Evaluation metrics are quantitative measures used to assess the performance of a machine
learning model. They help in determining how well a model is performing on a given task,
whether it’s classification, regression, or other types of problems.
Evaluation metrics provide insights into how well a machine learning model performs on its
task.

The metrics that are being used in this platform are: Perplexity, ROUGE and BLEU.

Perplexity
Perplexity is a metric used to measure how well a model predicts a sample of text.In simpler
terms, perplexity can be thought of as a measure of how "confused" or uncertain the model
is when predicting the next word or sequence of words in a sentence.
A low perplexity value indicates that the model is good at predicting the next word in a
sequence, meaning the model is confident and accurate in its predictions.
A high perplexity value suggests that the model is uncertain or less accurate in its
predictions.
Perplexity is calculated by measuring the model’s likelihood of predicting the actual
sequence of words in the test data.

BLEU:
Bilingual Evaluation Understudy, Compares n-grams(sequence of words) in generated text
with reference text.
Captures surface level similarity between model output and reference output, meaning it
doesnt capture the semantic meaning but just word to word.
Better for translation tasks.

ROUGE:
Recall Oriented Understudy for Gisting Evaluation,
Measures n-gram overlap, particularly recall between generated and reference text.
Measures recall, focusing on whether important parts of reference text are present in the
generated text.
Check if generated text captures key into even if the wording is not identical.
Better for summarization, to see whether the important data is preserved in the summarized
text.
More Concepts

LoRA

LoRA means low rank adaptation, used for fine tuning the model.

One way is training the full model, which is Full parameter FT, training.
Other way is domain fine tuning, for specific domain.such as finance, education,etc.
Other way can be specific task fine tuning. Eg, q and a chatbot, text to sql.

Challenges for Full Parameter training, where we:

Need to update all parameters, ie. weights.suppose 175B.
Hardware resource constraint.
Now what lora does is, instead of updating weights, it tracks changes,
Then it adds those lora tracked weights to pre trained weights

So this LoRA has its variants as well where its used in Different way.

DoRA
Where the weights are trained, but here matrix is decomposed to lower ranks, suppose 3x3
matrix is decomposed into 3x1 and 1x3 matrix. So when we are training the model, all the
updated weights are decomposed into smaller matrices. Thus requiring less parameters to
store same values. This solves the resource constraints,

When the weights are decomposed, the number of trainable parameters are decomposed a
lot, making the model also lighter.

Then there is also the concept of Quantization

Quantization
What is Quantization?
Conversion from higher memory format to a lower memory format.

Parameters involved in training a neural networks is weights.

Weights are usually in the form of matrix.
Every value inside weight matrix is stored as 32 bits, something denoted as FP32.
Floating point 32 bits, FP means full precision/Single precision.

Llama 2 with 70b parameters, have weights and bias…

Lost of cost to include such big model.

We can convert this 32 bits into 8 bits or even 4, then use the model,
This is what we call as quantization. When we quantize the model, we can inference it
quickly.

Calibration (Squeezing process)

Cuz we are squeezing the value from higher point to lower point.
How we can convert the FP32 to fp8,
Ie, how to perform quanitzation

1. Symmetric Quantization

2. Asymmetric Quantization

Symmetric unsigned int 8 Quantization

Convert a matrix which stored 32 bits number to unsigned int 8, 0-255.

For fp32
1 bit for sign, 8 for exponent, remaining 23 for mantissa

For fb16
1 bit for sign, 5 for expo and remaining 10 for mantissa

Suppose you want to convert matrix where it has numbr inside it as from 0 to 1000 which are
fp32. You want to turn to unsigned int 8, meaning from 0-255 range

You want to put 0.0 to 0 from 1000 to 255 between.

MinMax scaler , is used here

Scale factor = (xmax - xmin) / (qmax - qmin)

(1000-0)/(255-0) = 3.92, is a scale factor

Now round, where you, suppose you have number 250

Now div it by scale factor
250/3.92 = 64, so the 250 in fp32 , turns to 64 in unassigned int 8.

Asymmetric Quantization
[ -20.0 ….. 1000.0] to [0…..255]

Now if we want to do the same process, we do minmax scaling, 1000 - -20/ 255-0 =
1020/255 = 4.0, which is the scale factor, but when we do conversion of -20, we get -20/4 =
-5, now how can we store -5 when our 8 bit distribution starts from 0, so what we do is
To -5 we add +5 = to make it 0, now this +5 which made our number 0 is the zero point,

Note: in symmetric quantization, zero point was 0.

So in asymmetric , two parameters are scale factor 4, and zero point 5.
There two are few of the parameters used in quantization.

Modes of Quantization.
1. Post Training Quantization.(PTQ)
We have a pre trained model,--> then we perform calibration —> Quantized model →
any usecase
May have loss of data

2. Quantization Aware Training (QAT)

Trained Model → Quantization → Perform fine tuning , where we take new training
data, fine tune the model then we create the —> Quantized model.
Here we are not losing the data and in turn creating more training data.

QLoRA
Thus we can quantize LoRA as well to lower point such as 4 bit or 8 bit from higher points.
This enables us to use llms in less GPU power as well like colab.

More variants of LORA, :-

https://gautam75.medium.com/exploring-different-lora-variants-for-efficient-llm-fine-tuning-4c
a41179e658
To refer later.

Python Machine Learning By Example
From Everand
Python Machine Learning By Example
Yuxi (Hayden) Liu
4/5 (7)
Bayesian Statistical Modeling With Stan, R, and Python (Kentaro Matsuura) (Z-Library)
No ratings yet
Bayesian Statistical Modeling With Stan, R, and Python (Kentaro Matsuura) (Z-Library)
395 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
CRD Subsample
100% (3)
CRD Subsample
34 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
The Fundamentals of Machine Learning: Building Intelligent Systems from Data
From Everand
The Fundamentals of Machine Learning: Building Intelligent Systems from Data
Ethan Bennett
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
From Everand
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
Joe Grant
No ratings yet
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
From Everand
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
Mirza Rahim Baig
No ratings yet
Beyond The Algorithm: Practical Machine Learning Strategies
From Everand
Beyond The Algorithm: Practical Machine Learning Strategies
Jane Onwuchekwa
No ratings yet
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
From Everand
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Avishek Nag
No ratings yet
Functions:: Sparse Modeling
No ratings yet
Functions:: Sparse Modeling
7 pages
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
No ratings yet
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
44 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
KTO - Model Alignment As Prospect Theoretic Optimization
No ratings yet
KTO - Model Alignment As Prospect Theoretic Optimization
18 pages
System model A Complete Guide
From Everand
System model A Complete Guide
Gerardus Blokdyk
No ratings yet
Python Machine Learning: Introduction to Machine Learning with Python
From Everand
Python Machine Learning: Introduction to Machine Learning with Python
Frank Millstein
No ratings yet
DL Unit 4&5
No ratings yet
DL Unit 4&5
27 pages
Data model Second Edition
From Everand
Data model Second Edition
Gerardus Blokdyk
No ratings yet
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
No ratings yet
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
55 pages
Microsoft Azure Machine Learning
From Everand
Microsoft Azure Machine Learning
Sumit Mund
4.5/5 (3)
Data Science for Decision Makers: Enhance your leadership skills with data science and AI expertise
From Everand
Data Science for Decision Makers: Enhance your leadership skills with data science and AI expertise
Jon Howells
No ratings yet
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Naïve Bayes & Decision Algorithm
No ratings yet
Naïve Bayes & Decision Algorithm
19 pages
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Dual systems model A Clear and Concise Reference
From Everand
Dual systems model A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Function model Complete Self-Assessment Guide
From Everand
Function model Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Metadata modeling Second Edition
From Everand
Metadata modeling Second Edition
Gerardus Blokdyk
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Logical data model A Clear and Concise Reference
From Everand
Logical data model A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Elaborate On The Significance of Hyperparameter Optimization
No ratings yet
Elaborate On The Significance of Hyperparameter Optimization
5 pages
KTO: Model Alignment As Prospect Theoretic Optimization
No ratings yet
KTO: Model Alignment As Prospect Theoretic Optimization
19 pages
Operating model A Clear and Concise Reference
From Everand
Operating model A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Energy system Second Edition
From Everand
Energy system Second Edition
Gerardus Blokdyk
No ratings yet
Mastering Prompt Engineering
From Everand
Mastering Prompt Engineering
Youngsoo Chae
No ratings yet
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
Data system Third Edition
From Everand
Data system Third Edition
Gerardus Blokdyk
No ratings yet
Pyq ML
No ratings yet
Pyq ML
5 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
Data Reference Model The Ultimate Step-By-Step Guide
From Everand
Data Reference Model The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
System image A Clear and Concise Reference
From Everand
System image A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet
Systems modeling The Ultimate Step-By-Step Guide
From Everand
Systems modeling The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
From Everand
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
Margaux Masson-Forsythe
No ratings yet
Hyperparameter Tuning in DNNs
No ratings yet
Hyperparameter Tuning in DNNs
6 pages
Lecture 9 Model Selection
No ratings yet
Lecture 9 Model Selection
15 pages
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
From Everand
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
Anthony Phillips
No ratings yet
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
From Everand
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Alok Kumar
No ratings yet
Generic data model Complete Self-Assessment Guide
From Everand
Generic data model Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Metadata engine Complete Self-Assessment Guide
From Everand
Metadata engine Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Systems simulation Third Edition
From Everand
Systems simulation Third Edition
Gerardus Blokdyk
No ratings yet
Using Forecasting Methodologies to Explore an Uncertain Future
From Everand
Using Forecasting Methodologies to Explore an Uncertain Future
James Poon
No ratings yet
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Advance Statistics XXX
No ratings yet
Advance Statistics XXX
5 pages
Be314 2022-23 CW
No ratings yet
Be314 2022-23 CW
6 pages
ABB Electric Data (Customer Choice)
0% (2)
ABB Electric Data (Customer Choice)
63 pages
Lecture 24 PDF
100% (2)
Lecture 24 PDF
12 pages
GPower Manual
No ratings yet
GPower Manual
85 pages
The - Impact - of - Railway - Networks - On - Residential - Land Value
No ratings yet
The - Impact - of - Railway - Networks - On - Residential - Land Value
9 pages
Mid Exam Statistic
No ratings yet
Mid Exam Statistic
15 pages
Unit2 Maths IV
No ratings yet
Unit2 Maths IV
189 pages
2002 Multiple Choice Solutions
No ratings yet
2002 Multiple Choice Solutions
25 pages
EnPI V5.0 Algorithm Document
No ratings yet
EnPI V5.0 Algorithm Document
14 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
Crime Detecction DL Model ConvLSTM2D Analysis and Results
No ratings yet
Crime Detecction DL Model ConvLSTM2D Analysis and Results
4 pages
Cardboard
No ratings yet
Cardboard
47 pages
Topic 1 - Role of Statistics in Engineering
No ratings yet
Topic 1 - Role of Statistics in Engineering
15 pages
Student T-Distribution Table
67% (3)
Student T-Distribution Table
1 page
Comparing Groups For Statistical Differences - How To Choose The Right Statistical Test - Biochemia Medica
No ratings yet
Comparing Groups For Statistical Differences - How To Choose The Right Statistical Test - Biochemia Medica
8 pages
A Strategy To Assess Water Meter Perform
No ratings yet
A Strategy To Assess Water Meter Perform
11 pages
Statistics 1
No ratings yet
Statistics 1
4 pages
Dissertation 180513594 Jingsun
No ratings yet
Dissertation 180513594 Jingsun
47 pages
Chap013 Test Bank
No ratings yet
Chap013 Test Bank
7 pages
52
No ratings yet
52
81 pages
What Kind of Quantitative Methods For What Kind of Geography PDF
No ratings yet
What Kind of Quantitative Methods For What Kind of Geography PDF
9 pages
Chapter 5 Homework
No ratings yet
Chapter 5 Homework
7 pages
Chapter 11 Practice Exam: Multiple Choice
No ratings yet
Chapter 11 Practice Exam: Multiple Choice
6 pages
MTH400 - QM Assignments
No ratings yet
MTH400 - QM Assignments
2 pages
Sample Size (N) P 0.05 4 5 6 7 8 9 10 11 12 13 14 15: Spearman Rank-Order Coefficient of Correlation Page 1 of 2
No ratings yet
Sample Size (N) P 0.05 4 5 6 7 8 9 10 11 12 13 14 15: Spearman Rank-Order Coefficient of Correlation Page 1 of 2
2 pages
Chapter 17 - Logistic Regression
No ratings yet
Chapter 17 - Logistic Regression
32 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Model Fine Tuning Documentation

Uploaded by

Model Fine Tuning Documentation

Uploaded by

Model Fine Tuning Concepts

5:Metrics and Evaluation

Makes predictions based on what has happened in the past.

Use cases: Predictive tasks such as fraud detection, disease prediction,etc

• These models are usually used for following use cases:

Used in various applications such as:

Concepts in Dataset upload:

Put simply, it's just a curated dataset according to the task.

How it works is:

What is prospect theory?

Returning back to KTO,

How it works is:

What happens if we dont pass in any hyperparameters?

2. Randomized SearchCV:

3. Bayesian Optimization:

Difference between Inference and Serving?

Challenges for Full Parameter training, where we:

Then there is also the concept of Quantization

Parameters involved in training a neural networks is weights.

Llama 2 with 70b parameters, have weights and bias…

Calibration (Squeezing process)

1. Symmetric Quantization

Symmetric unsigned int 8 Quantization

You want to put 0.0 to 0 from 1000 to 255 between.

MinMax scaler , is used here

Scale factor = (xmax - xmin) / (qmax - qmin)

Now round, where you, suppose you have number 250

Note: in symmetric quantization, zero point was 0.

2. Quantization Aware Training (QAT)

More variants of LORA, :-

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Model Fine Tuning Documentation

Uploaded by

Model Fine Tuning Documentation

Uploaded by

Model Fine Tuning Concepts

5:Metrics and Evaluation

Makes predictions based on what has happened in the past.

Use cases: Predictive tasks such as fraud detection, disease prediction,etc

• These models are usually used for following use cases:

Used in various applications such as:

Concepts in Dataset upload:

Put simply, it's just a curated dataset according to the task.

How it works is:

What is prospect theory?

Returning back to KTO,

How it works is:

What happens if we dont pass in any hyperparameters?

2.​ Randomized SearchCV:

3.​ Bayesian Optimization:

Difference between Inference and Serving?

Challenges for Full Parameter training, where we:

Then there is also the concept of Quantization

Parameters involved in training a neural networks is weights.

Llama 2 with 70b parameters, have weights and bias…

Calibration (Squeezing process)

1.​ Symmetric Quantization

Symmetric unsigned int 8 Quantization

You want to put 0.0 to 0 from 1000 to 255 between.

MinMax scaler , is used here

Scale factor = (xmax - xmin) / (qmax - qmin)

Now round, where you, suppose you have number 250

Note: in symmetric quantization, zero point was 0.

2.​ Quantization Aware Training (QAT)

More variants of LORA, :-

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

2. Randomized SearchCV:

3. Bayesian Optimization:

1. Symmetric Quantization

2. Quantization Aware Training (QAT)