Skip to content

Latest commit

 

History

History
23 lines (17 loc) · 580 Bytes

File metadata and controls

23 lines (17 loc) · 580 Bytes

Basic Example

What it does

The basic example demonstrates how to start a LLM and run its main methods: (de-)tokenization and text completion (both streaming and non-streaming)

How to Build

  1. Download and extract the LlamaLib release LlamaLib-vX.X.X.zip from the latest release
  2. Build with CMake:
cmake -B build -DLLAMALIB_DIR=<path_to_extracted_LlamaLib_release>
cmake --build build

How to Run

  1. Copy a model .gguf file as model.gguf inside the directory

  2. Run

./build/main