`yek`

A fast Rust based tool to read text-based files in a repository or directory, chunk them, and serialize them for LLM consumption. By default, the tool:

Uses .gitignore rules to skip unwanted files.
Uses the Git history to infer what files are important.
Infers additional ignore patterns (binary, large, etc.).
Splits content into chunks based on either approximate "token" count or byte size.
Automatically detects if output is being piped and streams content instead of writing to files.
Supports processing multiple directories in a single command.
Configurable via a yek.toml file.

Yek (يک) means "One" in Farsi/Persian.

Consider having a simple repo like this:

.
├── README.md
├── src
│   ├── main.rs
│   └── utils.rs
└── tests
    └── test.rs

Running yek in this directory will produce a single file and write it to the temp directory with the following content:

>>>> README.md
... content of README.md ...
>>>> tests/test.rs
... content of tests/test.rs ...
>>>> src/utils.rs
... content of src/utils.rs ...
>>>> src/main.rs
... rest of the file ...

yek will prioritize more important files to come last in the output. This is useful for LLM consumption.

Installation

Via Homebrew (recommended for macOS)

brew tap bodo-run/yek https://github.com/bodo-run/yek.git
brew install yek

Via Install Script

For Unix-like systems (macOS, Linux):

curl -fsSL https://bodo.run/yek.sh | bash

For Windows (PowerShell):

irm https://bodo.run/yek.ps1 | iex

From Source

Install Rust.
Clone this repository.
Run make macos or make linux to build for your platform (both run cargo build --release).
Add to your PATH:

export PATH=$(pwd)/target/release:$PATH

Usage

yek has sensible defaults, you can simply run yek in a directory to serialize the entire repository. It will serialize all files in the repository into chunks of 10MB by default. The file will be written to the temp directory and file path will be printed to the console.

Examples

Process current directory and write to temp directory:

yek

Pipe output to clipboard (macOS):

yek src/ | pbcopy

Cap the max size to 128K tokens and only process the src directory:

yek --max-size 128K --tokens src/

Cap the max size to 100KB and only process the src directory, writing to a specific directory:

yek --max-size 100KB --output-dir /tmp/yek src/

Process multiple directories:

yek src/ tests/

Process multiple repositories:

yek ~/code/project1 ~/code/project2

Help

yek --help

Repository content chunker and serializer for LLM consumption

Usage: yek [OPTIONS] [directories]...

Arguments:
  [directories]...  Directories to process [default: .]

Options:
      --max-size <max-size>      Maximum size per chunk (e.g. '10MB', '128KB', '1GB') [default: 10MB]
      --tokens                   Count size in tokens instead of bytes
      --debug                    Enable debug output
      --output-dir <output-dir>  Output directory for chunks
  -h, --help                     Print help
  -V, --version                  Print version

Configuration File

You can place a file called yek.toml at your project root or pass a custom path via --config. The configuration file allows you to:

Add custom ignore patterns
Define file priority rules for processing order
Add additional binary file extensions to ignore (extends the built-in list)
Configure Git-based priority boost

Example `yek.toml`

This is optional, you can configure the yek.toml file at the root of your project.

# Add patterns to ignore (in addition to .gitignore)
[ignore_patterns]
patterns = [
  "node_modules/",
  "\\.next/",
  "my_custom_folder/"
]

# Configure Git-based priority boost (optional)
git_boost_max = 50  # Maximum score boost based on Git history (default: 100)

# Define priority rules for processing order
# Higher scores are processed first
[[priority_rules]]
score = 100
patterns = ["^src/lib/"]

[[priority_rules]]
score = 90
patterns = ["^src/"]

[[priority_rules]]
score = 80
patterns = ["^docs/"]

# Add additional binary file extensions to ignore
# These extend the built-in list (.jpg, .png, .exe, etc.)
binary_extensions = [
  ".blend",  # Blender files
  ".fbx",    # 3D model files
  ".max",    # 3ds Max files
  ".psd",    # Photoshop files
]

All configuration keys are optional. By default:

No extra ignore patterns
All files have equal priority (score: 1)
Git-based priority boost maximum is 100
Common binary file extensions are ignored (.jpg, .png, .exe, etc. - see source for full list)

Planned Features

Be smarter about finding out test files

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
.cargo		.cargo
.github/workflows		.github/workflows
Formula		Formula
benches		benches
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.releaserc		.releaserc
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
semantic-release.toml		semantic-release.toml
yek.toml		yek.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`yek`

Installation

Via Homebrew (recommended for macOS)

Via Install Script

From Source

Usage

Examples

Help

Configuration File

Example `yek.toml`

Planned Features

License

About

Uh oh!

Releases

Packages

Languages

License

Elias070/yek

Folders and files

Latest commit

History

Repository files navigation

yek

Installation

Via Homebrew (recommended for macOS)

Via Install Script

From Source

Usage

Examples

Help

Configuration File

Example yek.toml

Planned Features

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`yek`

Example `yek.toml`

Packages