System requirements
| Requirement | Version |
|---|---|
| Python | >=3.11, <3.12 (exactly Python 3.11) |
| GPU | CUDA-capable (required for Triton kernels) |
| Package manager | uv |
Step-by-step installation
Install uv
miniVLLM uses Restart your shell or run
uv for reproducible, isolated dependency management. Install it with the official installer:source $HOME/.local/bin/env to make the uv command available.Sync dependencies
uv sync reads pyproject.toml, resolves the dependency graph, and installs everything into a project-local virtual environment (.venv/). You do not need to create or activate a virtualenv manually.To also install the optional development dependencies:Dependencies explained
The core dependencies declared inpyproject.toml are:
| Package | Purpose |
|---|---|
torch | GPU tensor operations and the base for all model compute |
transformers | Model tokenizers and config loading (e.g. AutoTokenizer) |
xxhash | Fast hashing used by the block manager for KV cache prefix matching |
vllm>=0.15.0 | Provides reference kernels and utilities that miniVLLM builds on |
Optional dev dependencies
Install these when contributing to or testing the project:| Package | Purpose |
|---|---|
pytest>=7.0 | Test runner |
black>=23.0 | Code formatter |
isort>=5.0 | Import sorter |
uv sync vs pip install
miniVLLM is designed to be used with
uv sync, not pip install. The uv run prefix ensures commands run inside the managed virtual environment without needing to activate it manually. If you prefer a traditional workflow, you can activate .venv/ with source .venv/bin/activate and then run commands directly.pip install -r requirements.txt — there is no requirements.txt. All dependencies are declared in pyproject.toml and managed exclusively through uv.
Troubleshooting
CUDA not found / no CUDA-capable device
CUDA not found / no CUDA-capable device
miniVLLM requires a CUDA GPU. If you see an error like If this prints
AssertionError: CUDA is not available, check:- Your machine has an NVIDIA GPU.
- The CUDA toolkit is installed and on your
PATH. Runnvidia-smito confirm. - Your
torchinstallation includes CUDA support. Run:
False, reinstall PyTorch with the correct CUDA version from pytorch.org.Wrong Python version
Wrong Python version
If You need Python 3.11 specifically. Install it via your system package manager or python.org, then tell
uv sync reports a Python version conflict, check your active Python version:uv to use it:uv command not found after installation
uv command not found after installation
The Add this line to your
uv installer adds itself to ~/.local/bin/. If your shell does not pick it up, add it to your PATH:~/.bashrc or ~/.zshrc to make it permanent.ImportError: No module named 'myvllm'
ImportError: No module named 'myvllm'
miniVLLM’s source lives under Or, if using an activated virtualenv, add
src/. When running scripts directly with python instead of uv run python, the src/ directory may not be on your PYTHONPATH. Always use:src/ to your path manually: