Skip to content

Add support for vLLM KV-cache quantization #315

Add support for vLLM KV-cache quantization

Add support for vLLM KV-cache quantization #315

Workflow file for this run

name: Slow end to end tests
on:
push:
branches:
- main
- v*-release
pull_request:
branches:
- main
jobs:
run_tests:
name: Run tests
runs-on: 'aws-g4dn-2xlarge-use1-public-80'
steps:
- name: Install Git LFS
run: |
if ! command -v git-lfs &> /dev/null; then
echo "Installing Git LFS..."
sudo apt-get update && sudo apt-get install -y git-lfs
git lfs install
else
echo "Git LFS already installed."
fi
- name: Checkout repository
uses: actions/checkout@v4
with:
lfs: true
- name: Install uv
uses: astral-sh/setup-uv@v5
with:
enable-cache: true
- name: Install the project
run: uv sync --extra dev
- name: Run tests
run: uv run pytest --disable-pytest-warnings -o log_cli=true -o log_cli_level=INFO --runslow tests/slow_tests/