0% found this document useful (0 votes)

12 views18 pages

Clase1 Generating Your First Text

The document outlines the training process of Large Language Models (LLMs), detailing pretraining on vast text corpora and subsequent fine-tuning for specific tasks. It also discusses various applications of LLMs, responsible usage considerations, and the importance of prompt engineering in generating desired outputs. Additionally, it provides a structured lab exercise for students to explore LLM functionality and the effects of different prompts and parameters.

Uploaded by

trademartin2812

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views18 pages

Clase1 Generating Your First Text

Uploaded by

trademartin2812

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Generating your first text

Generative AI
Marlon S. Viñán Ludeña
Traditional Machine Learning
training
Training LLMs
1. Language modeling: The first step, called pretraining, takes
the majority of computation and training time. An LLM is
trained on a vast corpus of internet text allowing the model to
learn grammar, context, and language patterns. The resulting
model is often referred to as a foundation model or base
model. These models generally do not follow instructions.
2. Fine-tuning: The second step, fine-tuning or sometimes
post-training, involves using the previously trained model and
further training it on a narrower task.
Training LLMs
Large Language Models Applications
1. Detecting whether a review left by a customer is positive or negative: This is (supervised)
classification and can be handled with both encoder- and decoder-only models either with pretrained
models or fine-tuning models
2. Developing a system for finding common topics in ticket issues: This is (unsupervised)
classification for which we have no predefined labels. We can leverage encoder-only models to perform
the classification itself and decoder-only models for labeling the topics.
3. Building a system for retrieval and inspection of relevant documents: A major component of
language model systems is their ability to add external resources of information. Using semantic search,
we can build systems that allow us to easily access and find information for an LLM to use.
4. Constructing an LLM chatbot that can leverage external resources, such as tools and
documents: This is a combination of techniques that demonstrates how the true power of LLMs can be
found through additional components (Methods such as prompt engineering, retrieval-augmented
generation, and fine-tuning an LLM
5. Constructing an LLM capable of writing recipes based on a picture showing the products in
your fridge: This is a multimodal task where the LLM takes in an image and reasons about what it sees.
Responsible LLM development and Usage
1. Bias and fairness: LLMs are trained on large amounts of data that might contain
biases.
2. Transparency and accountability: Due to LLMs’ incredible capabilities, it is not
always clear when you are talking with a human or an LLM. As such, the usage of
LLMs when interacting with humans can have unintended consequences when there
is no human in the loop
3. Generating harmful content: they can be used to generate fake news, articles,
and other misleading sources of information.
4. Intellectual property: When the output is similar to a phrase in the training data,
does the intellectual property belong to the author of that phrase? Without access to
the training data it remains unclear when copyrighted material is being used by the
LLM.
5. Regulation: European AI act
Proprietary, Private Models
Open Models
Open source frameworks
● llama.cpp
● LangChain
● Hugging Face Transformers
● LlamaIndex
Generating Your First Text
Model: Phi-3 mini

Device: Less than 8 GB of VRAM

Licensed under the MIT license

When you use an LLM, two models are loaded:

1. The generative model itself

2. Its underlying tokenizer
Notes: Transformers.pipeline
Notes: Transformers.pipeline
return_full_text: By setting this to False, the prompt will not be returned
but merely
the output of the model.

max_new_tokens: The maximum number of tokens the model will generate.

By setting a limit, we prevent long and unwieldy output as some models
might continue generating output until they reach their context window.

do_sample: Whether the model uses a sampling strategy to choose the next
token. By setting this to False, the model will always select the next most
probable token
Challenge
Decoding the Language: Exploring Prompts and Parameters in LLMs

Learning Objectives:
● Students will understand the basic functionality of a pre-trained Large
Language Model (LLM).
● Students will explore how different prompts affect LLM outputs.
Decoding the Language: Exploring Prompts and Parameters in LLMs

Materials:
● Access to the provided Google Colab notebook:
https://colab.research.google.com/drive/1toOYoeLnFAaW0OWZ-bt4ji
xKzbrjlqg-?usp=sharing
● Internet connection.
● A text editor or document for recording observations.
Decoding the Language: Exploring Prompts and Parameters
in LLMs
Phase 1: Prompt Engineering Exploration

1. Introduction and Exploration:

○ Students begin by running the provided Colab notebook, ensuring they understand the basic code structure.
○ They should focus on the section where prompts are defined and the LLM response is generated.
○ Students are asked to run the default prompt and observe the output.
2. Prompt Modification:
○ Students are tasked with modifying the provided prompt in three distinct ways:
■ Specificity: Make the prompt more specific (e.g., instead of "Tell me a story," try "Tell me a short
science fiction story about a robot on Mars.").
■ Role-Playing: Instruct the LLM to adopt a specific persona (e.g., "Act as a Shakespearean playwright
and write a short monologue.").
■ Constraint: Add constraints to the output (e.g., "Write a poem that is exactly four lines long.").
○ For each modification, students should:
■ Record the modified prompt.
■ Run the code and record the LLM's output.
■ Document their observations: How did the output change? Did the LLM adhere to the modifications?
3. Analysis:
○ Students should write a brief reflection on the impact of prompt engineering. What makes a "good" prompt?
How can prompts be used to guide the LLM's behavior?
Decoding the Language: Exploring Prompts and Parameters in LLMs

Phase 2: Creative Experimentation

1. Design a prompt that generates a short story of at least 100 words.
2. Modify the prompt to control the story’s tone (e.g., make it humorous,
dramatic, or scientific).
3. Change the configuration parameters to investigate their influence on the
output. Putting the parameter do_sample = True, you activate various
decoding strategies which influence the next token from the probability
distribution over the entire vocabulary
4. Compare the responses and describe how prompt wording affects the
model’s output.
Decoding the Language: Exploring Prompts and Parameters in LLMs

Deliverable:
At the end of the lab, submit a brief report (max 2 pages) including:
● Answers to all questions.
● Screenshots of key results from the Colab notebook.
● Reflections on what you learned about LLMs from the experiments.

Fecha de entrega: 19 de marzo hasta las 23:59

Mod 4
No ratings yet
Mod 4
69 pages
PME System Guide 2024
No ratings yet
PME System Guide 2024
2,139 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
No ratings yet
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
Large Language Models (LLM)
100% (1)
Large Language Models (LLM)
139 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
AI Tools
No ratings yet
AI Tools
19 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
NAIPDC 2025 Bootcamp Slides - National AI Prompt Design Challenge Philippines
No ratings yet
NAIPDC 2025 Bootcamp Slides - National AI Prompt Design Challenge Philippines
93 pages
Prompt Design and Engineering
No ratings yet
Prompt Design and Engineering
25 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
Path To The LLM & Generative AI
No ratings yet
Path To The LLM & Generative AI
12 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Icaps LLM Tut Slides Posted
No ratings yet
Icaps LLM Tut Slides Posted
97 pages
ISC Important Programs Questions 2021
No ratings yet
ISC Important Programs Questions 2021
158 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
(English) Introduction To Large Language Models (DownSub - Com)
No ratings yet
(English) Introduction To Large Language Models (DownSub - Com)
9 pages
Large Language Models
No ratings yet
Large Language Models
32 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
Robotics - PPT For Ros Etc Students Good
No ratings yet
Robotics - PPT For Ros Etc Students Good
15 pages
Prompt Egineering Techniques
100% (1)
Prompt Egineering Techniques
31 pages
Prompt Engineering
No ratings yet
Prompt Engineering
44 pages
Chapter 3
No ratings yet
Chapter 3
44 pages
LLM Module 1 Notes
No ratings yet
LLM Module 1 Notes
4 pages
Prompt Engineer Xar
No ratings yet
Prompt Engineer Xar
26 pages
Prompt Engineering Mastery
No ratings yet
Prompt Engineering Mastery
9 pages
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (3)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (5)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
The Frizz
No ratings yet
The Frizz
33 pages
Making A Chat
No ratings yet
Making A Chat
3 pages
Clase6 Prompt Engineering
No ratings yet
Clase6 Prompt Engineering
69 pages
Lec # 12
No ratings yet
Lec # 12
26 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
No ratings yet
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
22 pages
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
No ratings yet
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
45 pages
03 NLP Document
No ratings yet
03 NLP Document
38 pages
P D E: I A M: Rompt Esign and Ngineering Ntroduction and Dvanced Ethods
No ratings yet
P D E: I A M: Rompt Esign and Ngineering Ntroduction and Dvanced Ethods
26 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Teaching LLMs To Think and Act - ReAct Prompt Engineering - by Bryan McKenney - Medium
No ratings yet
Teaching LLMs To Think and Act - ReAct Prompt Engineering - by Bryan McKenney - Medium
15 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
Azure-OpenAI-LVC-2
No ratings yet
Azure-OpenAI-LVC-2
20 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Chapter 3 - Model
No ratings yet
Chapter 3 - Model
37 pages
01 - What and Why of Prompts
No ratings yet
01 - What and Why of Prompts
21 pages
Output 5
No ratings yet
Output 5
77 pages
Understanding Large Language Models (LLMS) - A Mode
No ratings yet
Understanding Large Language Models (LLMS) - A Mode
3 pages
Fdma Cdma Tdma
No ratings yet
Fdma Cdma Tdma
23 pages
Large Language Models A Comprehensive Survey of It
No ratings yet
Large Language Models A Comprehensive Survey of It
30 pages
WireGuard - RouterOS - MikroTik Documentation
No ratings yet
WireGuard - RouterOS - MikroTik Documentation
1 page
Technical Seminar
No ratings yet
Technical Seminar
16 pages
Encoder:: US/.html
No ratings yet
Encoder:: US/.html
2 pages
T E I: M S T LLM: HE Thics of Nteractions Itigating Ecurity Hreats in S
No ratings yet
T E I: M S T LLM: HE Thics of Nteractions Itigating Ecurity Hreats in S
9 pages
Huyenchip Com 2023 04 11 LLM Engineering HTML
No ratings yet
Huyenchip Com 2023 04 11 LLM Engineering HTML
13 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Fact-Finding Techniques
No ratings yet
Fact-Finding Techniques
30 pages
Afzaal Shamraiz: Personal Information
No ratings yet
Afzaal Shamraiz: Personal Information
2 pages
The Athena Service Management System: Mark A. Rosenstein Daniel E. Geer, Jr. Peter J. Levine
No ratings yet
The Athena Service Management System: Mark A. Rosenstein Daniel E. Geer, Jr. Peter J. Levine
11 pages
SSRN Id4655822
No ratings yet
SSRN Id4655822
9 pages
SOP-00387 SAP Time Tracking
No ratings yet
SOP-00387 SAP Time Tracking
17 pages
LLM Basics
No ratings yet
LLM Basics
3 pages
User Manual: Command Console Application
No ratings yet
User Manual: Command Console Application
48 pages
LLM 1
No ratings yet
LLM 1
6 pages
2 Notes
No ratings yet
2 Notes
3 pages
RSA Archer Integration Guide
No ratings yet
RSA Archer Integration Guide
26 pages
LLM Model
No ratings yet
LLM Model
3 pages
Large Language Models
No ratings yet
Large Language Models
2 pages
Website Design Client Onboarding Template
No ratings yet
Website Design Client Onboarding Template
14 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
ADCA Final Question PDF
50% (2)
ADCA Final Question PDF
6 pages
Campus Drive JD-InFynd
No ratings yet
Campus Drive JD-InFynd
4 pages
Part 2: Defining PLM - Critical "Must Have" Capabilities
No ratings yet
Part 2: Defining PLM - Critical "Must Have" Capabilities
7 pages
Troubleshooting LV-CAN200, ALL-CAN300, CAN-CONTROL
No ratings yet
Troubleshooting LV-CAN200, ALL-CAN300, CAN-CONTROL
3 pages
Product Integration TWS and ITOM
No ratings yet
Product Integration TWS and ITOM
44 pages
A02yyuw Su Gecirmez Ultrasonik Sensor Datasheet
No ratings yet
A02yyuw Su Gecirmez Ultrasonik Sensor Datasheet
8 pages
Task Sheet 3
No ratings yet
Task Sheet 3
4 pages
Elevator Control System
No ratings yet
Elevator Control System
24 pages
Guide 4 Prompt Engineering
No ratings yet
Guide 4 Prompt Engineering
1 page
6es7132 6BH00 0aa0
No ratings yet
6es7132 6BH00 0aa0
4 pages
Worksheet 2.11 Unit Testing
No ratings yet
Worksheet 2.11 Unit Testing
8 pages
C Programming Questions
No ratings yet
C Programming Questions
28 pages
Soft Maint
No ratings yet
Soft Maint
2 pages
ECommerce Supplier EDI Toolkit
No ratings yet
ECommerce Supplier EDI Toolkit
21 pages
User Acceptance Test
No ratings yet
User Acceptance Test
3 pages
LRC Resources For : Animation, Interaction & Moving Image
No ratings yet
LRC Resources For : Animation, Interaction & Moving Image
8 pages
Algorithm Challenges: The Dojo Collection
From Everand
Algorithm Challenges: The Dojo Collection
Martin Puryear
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Clase1 Generating Your First Text

Uploaded by

Clase1 Generating Your First Text

Uploaded by

Generating your first text

Device: Less than 8 GB of VRAM

Licensed under the MIT license

When you use an LLM, two models are loaded:

1. The generative model itself

max_new_tokens: The maximum number of tokens the model will generate.

1. Introduction and Exploration:

Phase 2: Creative Experimentation

Fecha de entrega: 19 de marzo hasta las 23:59

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.