a small gpt-2 implementation in apple's mlx. the script loads gpt-2 weights through hugging face transformers, maps them into the mlx model, and runs cached text generation with temperature and top-k sampling.
uv run model.py| Name | Name | Last commit date | ||
|---|---|---|---|---|