0% found this document useful (0 votes)

66 views2 pages

Llamacpp

The document provides instructions for compiling and installing CLBlast and llama.cpp libraries to run large language models using OpenCL. It clones repositories, cleans and builds projects, sets environment variables and runs models on Intel GPUs, measuring load and generation times.

Uploaded by

Sheik Mohamed Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views2 pages

Llamacpp

Uploaded by

Sheik Mohamed Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

git clone https://github.com/CNugteren/CLBlast.

git
cd CLBlast
rm -rf build
mkdir build
cd build
make clean
cmake .. -DOPENCL_INCLUDE_DIRS=/opt/intel/oneapi/compiler/latest/linux/include/sycl
-DOPENCL_LIBRARIES=/opt/intel/oneapi/compiler/latest/linux/lib/libOpenCL.so -
DCMAKE_EXE_LINKER_FLAGS="-Wl,-rpath,/opt/intel/oneapi/compiler/latest/linux/
compiler/lib/intel64_lin" -DCMAKE_INSTALL_PREFIX=./Release -DBUILD_SHARED_LIBS=OFF
-DTUNERS=OFF
make
make install

cd ~/wrkdir

git clone https://github.com/ggerganov/llama.cpp.git

mkdir llama.cpp/build
cd llama.cpp/build
export LD_LIBRARY_PATH=/opt/intel/oneapi/compiler/latest/linux/include/:
$LD_LIBRARY_PATH

make clean
cmake .. -DLLAMA_CLBLAST=ON
-DCLBlast_DIR=/home/imran/wrkdir/CLBlast/build/Release/lib/cmake/CLBlast -
DCMAKE_EXE_LINKER_FLAGS="-Wl,-rpath,/opt/intel/oneapi/compiler/latest/linux/
compiler/lib/intel64_lin"
cmake --build . --config Release

export LD_LIBRARY_PATH=/opt/intel/oneapi/compiler/latest/linux/compiler/lib/
intel64_lin:$LD_LIBRARY_PATH

base) imran@flex-r3-u44:~/wrkdir/llama.cpp/build/bin$ clinfo -l

Platform #0: Intel(R) OpenCL Graphics
+-- Device #0: Intel(R) Data Center GPU Flex 170
`-- Device #1: Intel(R) Data Center GPU Flex 170

download llama-2-13b.ggmlv3.q4_0.bin
from https://huggingface.co/TheBloke/Llama-2-13B-GGML/tree/main

GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE=0 ./main -m

~/wrkdir/llama-2-13b.ggmlv3.q4_0.bin -n 128 -ngl 40 -p "Building a website can be
done in 10 simple steps:"
GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE=1 ./main -m
~/wrkdir/llama-2-13b.ggmlv3.q4_0.bin -n 128 -ngl 40 -p "Building a website can be
done in 10 simple steps:"
GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE="#1" ./main -m
~/wrkdir/llama-2-13b.ggmlv3.q4_0.bin -n 128 -ngl 40 -p "Building a website can be
done in 10 simple steps:"
GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE=0 ./main -m
~/wrkdir/llama-2-7b.ggmlv3.q4_0.bin -n 128 -ngl 40 -p "Building a website can be
done in 10 simple steps:"
GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE=0,1 ./main --
main-gpu 0 --tensor-split 30,30 -m ~/wrkdir/llama2-22b-daydreamer-
v2.ggmlv3.q8_0.bin -n 128 -ngl 40 -p "Building a website can be done in 10 simple
steps:"

(base) hcp@scrappie1:~/projects/wizard/llama.cpp$ ./main -ngl 80 --main-gpu 0 --

tensor-split 0,12,12,12,12,12,12,8 -m ~/Desktop/models/alpaca-lora-
65B.ggmlv3.q4_0.bin -p "Instruction: Make a list of 10 imaginary fruits with a
description including shape, color and taste. List: "

load time: time it takes for the model to load.

sample time: time it takes to "tokenize" (sample) the prompt message for it to be
processed by the program.
prompt eval time: time it takes to process the tokenized prompt message. If this
isn't done, there would be no context for the model to know what token to predict
next.
eval time: time needed to generate all tokens as the response to the prompt
(excludes all pre-processing time, and it only measures the time since it starts
outputting tokens).

set(CLBlast_INCLUDE_DIRS "${CMAKE_CURRENT_LIST_DIR}/../../../include")
set(CLBlast_LIBRARIES "${CMAKE_CURRENT_LIST_DIR}/../../../lib/libclblast.a")

Modern Cmake
No ratings yet
Modern Cmake
140 pages
C Make Lists
No ratings yet
C Make Lists
15 pages
CMake Lists
No ratings yet
CMake Lists
29 pages
C Make Lists
No ratings yet
C Make Lists
8 pages
GPT
No ratings yet
GPT
6 pages
1
No ratings yet
1
53 pages
C Make Cache
No ratings yet
C Make Cache
8 pages
CMake Lists
No ratings yet
CMake Lists
14 pages
Ceng513 Pa1 Report: System Information Processor
No ratings yet
Ceng513 Pa1 Report: System Information Processor
4 pages
CMake Lists
No ratings yet
CMake Lists
31 pages
C Make Lists
No ratings yet
C Make Lists
1 page
CMake Lists
No ratings yet
CMake Lists
15 pages
CMake Lists
No ratings yet
CMake Lists
29 pages
CMake Lists
No ratings yet
CMake Lists
42 pages
CMake Lists
No ratings yet
CMake Lists
8 pages
CMake Lists
No ratings yet
CMake Lists
3 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
CMake Lists
No ratings yet
CMake Lists
14 pages
Huber A CPlusPlus Toolchain For Your GPU
No ratings yet
Huber A CPlusPlus Toolchain For Your GPU
24 pages
Cmake Minimum Required-Robot
No ratings yet
Cmake Minimum Required-Robot
1 page
Opengl Tutorial 2017 06 07 PDF
No ratings yet
Opengl Tutorial 2017 06 07 PDF
169 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
Project Structure
No ratings yet
Project Structure
6 pages
CMake Lists
No ratings yet
CMake Lists
4 pages
CMake Lists
No ratings yet
CMake Lists
39 pages
Hands On Opencl: Created by Simon Mcintosh-Smith and Tom Deakin
No ratings yet
Hands On Opencl: Created by Simon Mcintosh-Smith and Tom Deakin
258 pages
CMake Lists
No ratings yet
CMake Lists
24 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
C Make Lists
No ratings yet
C Make Lists
2 pages
CMake Lists
No ratings yet
CMake Lists
15 pages
CMake Lists
No ratings yet
CMake Lists
14 pages
CMake Lists
No ratings yet
CMake Lists
3 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
Creating A ChatGPT Clone That Runs On Your Laptop With Go by Sau Sheong
No ratings yet
Creating A ChatGPT Clone That Runs On Your Laptop With Go by Sau Sheong
20 pages
Sri Lanka Institute of Information Technology
No ratings yet
Sri Lanka Institute of Information Technology
211 pages
CMake Lists
No ratings yet
CMake Lists
15 pages
CMake Cache
No ratings yet
CMake Cache
7 pages
Phplot
No ratings yet
Phplot
380 pages
CMake Cache
No ratings yet
CMake Cache
7 pages
Ununtu
No ratings yet
Ununtu
1 page
CS1 Final
No ratings yet
CS1 Final
228 pages
B
No ratings yet
B
3 pages
CMake Lists
No ratings yet
CMake Lists
9 pages
C Make Lists
No ratings yet
C Make Lists
10 pages
C Make Lists
No ratings yet
C Make Lists
2 pages
Migration From ECC To HANA
No ratings yet
Migration From ECC To HANA
38 pages
High-Speed Inference With Llama - CPP and Vicuna On CPU by Benjamin Marie Jun, 2023 Towards AI
No ratings yet
High-Speed Inference With Llama - CPP and Vicuna On CPU by Benjamin Marie Jun, 2023 Towards AI
21 pages
CMake Lists
No ratings yet
CMake Lists
3 pages
CMake Lists
No ratings yet
CMake Lists
5 pages
Opencl: These Notes Will Introduce Opencl
No ratings yet
Opencl: These Notes Will Introduce Opencl
34 pages
CMake Lists
No ratings yet
CMake Lists
16 pages
Alloc 150 DM
No ratings yet
Alloc 150 DM
301 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
Software Engineering QPS Solutions
No ratings yet
Software Engineering QPS Solutions
80 pages
Quetion 19
No ratings yet
Quetion 19
3 pages
CMake Lists
No ratings yet
CMake Lists
6 pages
Modern GPU
100% (1)
Modern GPU
221 pages
CMake Lists
No ratings yet
CMake Lists
6 pages
CMake Lists
No ratings yet
CMake Lists
7 pages
C Make Lists
No ratings yet
C Make Lists
8 pages
CMake Lists
No ratings yet
CMake Lists
6 pages
Own Cloud Developer Manual
No ratings yet
Own Cloud Developer Manual
179 pages
OData Presentation
No ratings yet
OData Presentation
60 pages
C Make Lists
No ratings yet
C Make Lists
6 pages
Visual Basic Viva Questions and Answers BCA Kuk University
No ratings yet
Visual Basic Viva Questions and Answers BCA Kuk University
10 pages
Aioug Techday Pune
No ratings yet
Aioug Techday Pune
20 pages
The OpenCV Installed With The Jetpack Does Not Have CUDA Supported
No ratings yet
The OpenCV Installed With The Jetpack Does Not Have CUDA Supported
11 pages
Hibernate Best Practices
No ratings yet
Hibernate Best Practices
10 pages
Python With Data Science
No ratings yet
Python With Data Science
102 pages
The openCV Installed With The Jetpack Does Not Have CUDA Supported PDF
No ratings yet
The openCV Installed With The Jetpack Does Not Have CUDA Supported PDF
11 pages
Brochure Big Data
No ratings yet
Brochure Big Data
6 pages
Return To Basics I: Understanding POWER7 Capacity Entitlement and Virtual Processors
No ratings yet
Return To Basics I: Understanding POWER7 Capacity Entitlement and Virtual Processors
42 pages
Assignment 3 Mainframe
No ratings yet
Assignment 3 Mainframe
5 pages
Infromation System1
No ratings yet
Infromation System1
47 pages
Python-Unit-6 R16 PDF
No ratings yet
Python-Unit-6 R16 PDF
19 pages
Merchandising Upgrade Guide-Release 160
No ratings yet
Merchandising Upgrade Guide-Release 160
24 pages
Java String
No ratings yet
Java String
6 pages
Compiler Construction: Vana Doufexi
No ratings yet
Compiler Construction: Vana Doufexi
18 pages
Sample of Test Strategy
No ratings yet
Sample of Test Strategy
6 pages
S4HANA User Interface
No ratings yet
S4HANA User Interface
20 pages
Datasheet Microsoft
No ratings yet
Datasheet Microsoft
2 pages
Functions in Informatica
No ratings yet
Functions in Informatica
15 pages
Rde 15x Whitepaper
No ratings yet
Rde 15x Whitepaper
12 pages
RMS 16.0 Security Whitepaper
No ratings yet
RMS 16.0 Security Whitepaper
16 pages
File List
No ratings yet
File List
10 pages
Lab - Weighted Composite Complexity Measure - Answer
No ratings yet
Lab - Weighted Composite Complexity Measure - Answer
3 pages
Navisworks Keyboard Shortcuts - Symetri - Ie
No ratings yet
Navisworks Keyboard Shortcuts - Symetri - Ie
5 pages
Fikaril Akhyar
No ratings yet
Fikaril Akhyar
4 pages
Zabbix Configuration - Ubuntu
No ratings yet
Zabbix Configuration - Ubuntu
17 pages
Objective/Goals Action Resources Evidence of Achievement Due Date
No ratings yet
Objective/Goals Action Resources Evidence of Achievement Due Date
4 pages
Luther Neil C. Ramos: 215 500 6925 Lramos@mailbank - Us
No ratings yet
Luther Neil C. Ramos: 215 500 6925 Lramos@mailbank - Us
6 pages
Interface V2
No ratings yet
Interface V2
3 pages
BIPublisher Arabic Numerals
No ratings yet
BIPublisher Arabic Numerals
3 pages
It Job Family Career Ladder Matrix Qa Analyst
No ratings yet
It Job Family Career Ladder Matrix Qa Analyst
1 page
ReleaseNote FileList of G701VI WIN10 64 V3.01
No ratings yet
ReleaseNote FileList of G701VI WIN10 64 V3.01
2 pages
Cloud Computing Integration Introduction
No ratings yet
Cloud Computing Integration Introduction
21 pages
Build your own Blockchain: Make your own blockchain and trading bot on your pc
From Everand
Build your own Blockchain: Make your own blockchain and trading bot on your pc
Magelan Cybersecurity
No ratings yet
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
From Everand
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
John Edward Cooper Berg
No ratings yet
Linux System Administrator Interview Questions You'll Most Likely Be Asked
From Everand
Linux System Administrator Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Llamacpp

Uploaded by

Llamacpp

Uploaded by

git clone https://github.com/CNugteren/CLBlast.

git clone https://github.com/ggerganov/llama.cpp.git

base) imran@flex-r3-u44:~/wrkdir/llama.cpp/build/bin$ clinfo -l

GGML_OPENCL_PLATFORM="Intel(R) OpenCL Graphics" GGML_OPENCL_DEVICE=0 ./main -m

(base) hcp@scrappie1:~/projects/wizard/llama.cpp$ ./main -ngl 80 --main-gpu 0 --

load time: time it takes for the model to load.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.