0% found this document useful (0 votes)

17 views11 pages

DeepSeek图解10页

The document discusses DeepSeek, a framework related to large language models (LLMs) and transformers, detailing its components, including pretraining, supervised fine-tuning, and reinforcement learning. It also introduces DeepSeek-R1, which emphasizes reasoning-oriented reinforcement learning and various model checkpoints. Additionally, it provides links for further reading on related topics and methodologies.

Uploaded by

vongolaprimo276918

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views11 pages

DeepSeek图解10页

Uploaded by

vongolaprimo276918

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

DeepSeek 10 PDF

1 DeepSeek . . . . . . . . . . . . . . . . . . . . . . 2
1.1 DeepSeek . . . . . . . . . . . . . . . . . 2
1.2 DeepSeek . . . . . . . . . . . . . . . . . . . 2
1.3 DeepSeek . . . . . . . . . . . . . . . . . . . 4

2 DeepSeek . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 LLM . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Transformer . . . . . . . . . . . . . . . . . . . . . . 6
2.3 LLM . . . . . . . . . . . . . . . . . . . . . . . . 7
2.3.1 Pretraining . . . . . . . . . . . . . . . . . . 7
2.3.2 Supervised Fine-Tuning, SFT . . . . . . 7
2.3.3 Reinforcement Learning, RL . . . . . . . 7

3 DeepSeek-R1 . . . . . . . . . . . . . . . . . . . . . . . 7
3.1 DeepSeek-R1 . . . . . . . . . . . . . . . . . . . 7
3.1.1 1 R1-Zero . . . . . . . 8
3.1.2 2 . . . . . . . . . . . . . . . 8
3.2 R1-Zero . . . . . . . . . . . . . . 9
3.3 . . . . . . . . . . . . . . . . . . . . . . 10
3.4 DeepSeek-R1 . . . . . . . . . . . . . . . . . . . . . . . . 11

4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

1
1 DeepSeek

1.1 DeepSeek

DeepSeek
1.

2. Fine-tuning

DeepSeek

• DeepSeek R1

1.2 DeepSeek

DeepSeek
ollama ollama

Ollama
1
1:

ollama
10 2

2: Ollama

ollama pull deepseek-r1:1.5b deepseek-

r1 3

3: DeepSeek-r1
DeepSeek
cmd(Windows ) terminal(
) ollama run deepseek-r1:1.5b
4

4: Ollama deepseek-r1

1.3 DeepSeek

DeepSeek
Python ? think

5: deepseek-r1

think
6 :

6: deepseek-r1

2 DeepSeek
DeepSeek-R1 LLM

AI Large
Language Model, LLM LLM NLP

LLM
LLM

2.1 LLM

deepseek-r1:1.5b, qwen:7b, llama:8b

1.5b, 7b 8b b billion 7b 70 8b
80 70 80 weight+bias
Transformer Transformer
70 80
ImageNet 20News-
Group

Scaling Laws

Scaling Laws
Scaling Laws

Transformer Scaling Laws

Transformer

2.2 Transformer

LLM 2017 Google Transformer

RNN LSTM
Transformer 1.
Self-Attention
2. Multi-Head Attention
3.
FFN 4.
Positional Encoding

Transformer

1.
2.
3. AI
2.3 LLM

2.3.1 Pretraining

LLM 1.
2.
3.

2.3.2 Supervised Fine-Tuning, SFT

SFT

2.3.3 Reinforcement Learning, RL

RL RLHF,
Reinforcement Learning from Human Feedback

RLHF

• 1

• 2

• 3

3 DeepSeek-R1

3.1 DeepSeek-R1

DeepSeek-R1
AI RL SFT
AI
DeepSeek-V3
SFT +
7

7: R1

DeepSeek-R1 DeepSeek-v3-Base

3.1.1 1 R1-Zero

7 Reasoning-Oriented Reinforcement Learn-

ing Iterim reasoning model , 8

DeepSeek-R1
DeepSeek-R1-Zero
R1-Zero Chain-of-Thought,
CoT SFT 7 3.2

3.1.2 2

R1-Zero
DeepSeek

7 General Reinforcement Learning SFT-

checkpoint RL
3.3

3.2 R1-Zero

SFT 8
SFT

8: Interim reasoning model

DeepSeek
R1-Zero
R1-Zero SFT
9 V3

9: R1-Zero

OpenAI O1
10 pass@1 16
cons@16
OpenAI O1 DeepSeek-R1-Zero
OpenAI O1.

10: R1-Zero

3.3

Preference Tuning 11
R1

helpfulness safety Llama

DeepSeek-R1 R1-Zero
AI

11: R1
3.4 DeepSeek-R1

Reasoning-Oriented RL
CoT

DeepSeek-R1 R1-Zero

Reasoning-Oriented
RL CoT

DeepSeek-R1 R1-Zero

4
https://newsletter.languagemodels.co/p/the-illustrated-deepseek-r1
https://www.interconnects.ai/p/deepseek-r1-recipe-for-o1
https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mixture-of-
experts

Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
100% (1)
Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
18 pages
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
21 pages
How DeepSeek-R1 Was Built - Architecture and Training Explained
No ratings yet
How DeepSeek-R1 Was Built - Architecture and Training Explained
12 pages
A Perfect Guide To DeepSeek R1
No ratings yet
A Perfect Guide To DeepSeek R1
9 pages
DeepSeek R1 Running Locally A Complete Setup Guide
No ratings yet
DeepSeek R1 Running Locally A Complete Setup Guide
18 pages
DeepSeek R1 Dual
No ratings yet
DeepSeek R1 Dual
44 pages
Drawing DeepSeek R1 Architecture and Training Process From Scratch - by Fareed Khan - Feb, 2025 - Level Up Coding
No ratings yet
Drawing DeepSeek R1 Architecture and Training Process From Scratch - by Fareed Khan - Feb, 2025 - Level Up Coding
39 pages
The DeepSeek Series A Technical Overview
No ratings yet
The DeepSeek Series A Technical Overview
11 pages
DeepSeek Modelss
No ratings yet
DeepSeek Modelss
52 pages
DeepSeek Models Explained
No ratings yet
DeepSeek Models Explained
11 pages
How I Built My Own AI Web Agent (And Saved Hundreds A Month!) - by Algo Insights - Coding Nexus - Apr, 2025 - Medium
No ratings yet
How I Built My Own AI Web Agent (And Saved Hundreds A Month!) - by Algo Insights - Coding Nexus - Apr, 2025 - Medium
15 pages
DeepSeek-R1: Enhanced Reasoning Via Reinforcement Learning
No ratings yet
DeepSeek-R1: Enhanced Reasoning Via Reinforcement Learning
9 pages
Learn About Deepseek
No ratings yet
Learn About Deepseek
8 pages
DeepSeek Proves AI Comes For All Jobs - Even AI Jobs
No ratings yet
DeepSeek Proves AI Comes For All Jobs - Even AI Jobs
6 pages
Course - A Deep Understanding of Deep Learning (With Python Intro)
No ratings yet
Course - A Deep Understanding of Deep Learning (With Python Intro)
4 pages
Morgan & Claypool - Introduction To Deep Learning For Engineers Using Python and Google Clod Platform - 2020
No ratings yet
Morgan & Claypool - Introduction To Deep Learning For Engineers Using Python and Google Clod Platform - 2020
111 pages
DeepSeek R1
No ratings yet
DeepSeek R1
22 pages
DeepSeek R1
No ratings yet
DeepSeek R1
22 pages
First Newsletter Cglug
No ratings yet
First Newsletter Cglug
3 pages
AI2
No ratings yet
AI2
1 page
Run DeepSeek Models Locally in 5 Minutes
No ratings yet
Run DeepSeek Models Locally in 5 Minutes
10 pages
AI Learning Resources
No ratings yet
AI Learning Resources
6 pages
Hands-On Guide Running DeepSeek LLMs Locally
No ratings yet
Hands-On Guide Running DeepSeek LLMs Locally
10 pages
Deepseek-R1: Incentivizing Reasoning Capability in Llms Via Reinforcement Learning
No ratings yet
Deepseek-R1: Incentivizing Reasoning Capability in Llms Via Reinforcement Learning
24 pages
DeepSeek R1
No ratings yet
DeepSeek R1
23 pages
Deep Seek R1
No ratings yet
Deep Seek R1
7 pages
DeepSeek Research
No ratings yet
DeepSeek Research
6 pages
Guía Práctica para Dominar DeepSeek R1
No ratings yet
Guía Práctica para Dominar DeepSeek R1
1 page
Nnet - Ug 1 150 PDF
No ratings yet
Nnet - Ug 1 150 PDF
150 pages
DeepSeek-Coder-v2 - The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) (DownSub - Com)
No ratings yet
DeepSeek-Coder-v2 - The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) (DownSub - Com)
14 pages
Deepseek R1 Opensource Ai Reasoning: Swipe
No ratings yet
Deepseek R1 Opensource Ai Reasoning: Swipe
13 pages
DeepSeek R1
No ratings yet
DeepSeek R1
2 pages
Certainly
No ratings yet
Certainly
3 pages
Report On DeepSeek-R1
No ratings yet
Report On DeepSeek-R1
12 pages
1.1. Contributions Post-Training: Large-Scale Reinforcement Learning On The Base Model
No ratings yet
1.1. Contributions Post-Training: Large-Scale Reinforcement Learning On The Base Model
1 page
How To Run Deepseek Locally
No ratings yet
How To Run Deepseek Locally
10 pages
SlideEgg 301128-DeepSeek
No ratings yet
SlideEgg 301128-DeepSeek
22 pages
Affan 1
No ratings yet
Affan 1
24 pages
A Technical Primer On Deepseek
No ratings yet
A Technical Primer On Deepseek
18 pages
RNN Part-2
No ratings yet
RNN Part-2
181 pages
LeeDL Tutorial v.1.1.1
No ratings yet
LeeDL Tutorial v.1.1.1
11 pages
Deep Learning Course File
No ratings yet
Deep Learning Course File
56 pages
DeepSeek Ai Research
No ratings yet
DeepSeek Ai Research
3 pages
SOW For Devs
No ratings yet
SOW For Devs
2 pages
Achine Learning Actionable Roadmap
No ratings yet
Achine Learning Actionable Roadmap
19 pages
PE - IV - 102047804 - Deep Learning and Applications
No ratings yet
PE - IV - 102047804 - Deep Learning and Applications
3 pages
OpenAI Deep Research
No ratings yet
OpenAI Deep Research
12 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Deep Learning With Tensor Ow and Google Cloud Ai 2-In-1
No ratings yet
Deep Learning With Tensor Ow and Google Cloud Ai 2-In-1
6 pages
DeepSeek pdf-1
No ratings yet
DeepSeek pdf-1
7 pages
CE0733 - Machine Learning and Deep Learning - Compulsory
No ratings yet
CE0733 - Machine Learning and Deep Learning - Compulsory
3 pages
The New King of AI Coding
No ratings yet
The New King of AI Coding
8 pages
AI Engineer Road Map 2024
No ratings yet
AI Engineer Road Map 2024
9 pages
Coursepack Deep Learningeven2024 - R1uc604c
No ratings yet
Coursepack Deep Learningeven2024 - R1uc604c
13 pages
IBM Data Science Certification-Merged
No ratings yet
IBM Data Science Certification-Merged
2 pages
How To Prompt DeepSeek The ChatGPT Killer 1738331990
No ratings yet
How To Prompt DeepSeek The ChatGPT Killer 1738331990
16 pages
Lecture 1
No ratings yet
Lecture 1
100 pages
Generative AI Tghjraining in Hyderabad
No ratings yet
Generative AI Tghjraining in Hyderabad
22 pages
Data Science Learning Path
No ratings yet
Data Science Learning Path
43 pages
RAG and LangChain
100% (1)
RAG and LangChain
14 pages
Pes Medium Ai Article05
No ratings yet
Pes Medium Ai Article05
4 pages
TensorFlow构建机器学习项目: Chinese Edition
From Everand
TensorFlow构建机器学习项目: Chinese Edition
Posts & Telecom Press
No ratings yet
Neural Language Model, RNNS: Pawan Goyal
No ratings yet
Neural Language Model, RNNS: Pawan Goyal
15 pages
CNN Building Blocks
No ratings yet
CNN Building Blocks
14 pages
Cheat Sheet For Exam
No ratings yet
Cheat Sheet For Exam
2 pages
Deep Learning Techniques For Geospatial Data Analysis: August 2020
No ratings yet
Deep Learning Techniques For Geospatial Data Analysis: August 2020
21 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
ML Unit 2
No ratings yet
ML Unit 2
23 pages
Ist 407 Presentation
No ratings yet
Ist 407 Presentation
12 pages
Assignment 6
No ratings yet
Assignment 6
2 pages
Syllabus INTRODUCTION TO DEEP LEARNING
No ratings yet
Syllabus INTRODUCTION TO DEEP LEARNING
11 pages
Pseudo Label Final
No ratings yet
Pseudo Label Final
7 pages
Plotting Decision Regions - 1 - Mlxtend
No ratings yet
Plotting Decision Regions - 1 - Mlxtend
5 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Badjatiya 2017
No ratings yet
Badjatiya 2017
2 pages
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
No ratings yet
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
96 pages
AIML Lect5 Assignment ID3
No ratings yet
AIML Lect5 Assignment ID3
2 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
ML Imp Ques 2
No ratings yet
ML Imp Ques 2
37 pages
DL Mod1.PDF Flashcards
No ratings yet
DL Mod1.PDF Flashcards
10 pages
Convolutional Neural Network in DIP
No ratings yet
Convolutional Neural Network in DIP
2 pages
Project Slide - Final
No ratings yet
Project Slide - Final
23 pages
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
Classroom Project Report Latex Template
No ratings yet
Classroom Project Report Latex Template
7 pages
UNIT-5-Modern Recurrent Neural Networks
No ratings yet
UNIT-5-Modern Recurrent Neural Networks
60 pages
Backpropagation Neural Network For XOR Problem Java Source Code
100% (2)
Backpropagation Neural Network For XOR Problem Java Source Code
7 pages
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
No ratings yet
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
13 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Introduction To Radial Basis Function Networks
No ratings yet
Introduction To Radial Basis Function Networks
45 pages
Unit 3 - Classification With Back Propagation
No ratings yet
Unit 3 - Classification With Back Propagation
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DeepSeek图解10页

Uploaded by

DeepSeek图解10页

Uploaded by

DeepSeek 10 PDF

ollama pull deepseek-r1:1.5b deepseek-

deepseek-r1:1.5b, qwen:7b, llama:8b

Transformer Scaling Laws

LLM 2017 Google Transformer

2.3.2 Supervised Fine-Tuning, SFT

2.3.3 Reinforcement Learning, RL

7 Reasoning-Oriented Reinforcement Learn-

7 General Reinforcement Learning SFT-

8: Interim reasoning model

helpfulness safety Llama

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.