Shimmy is a 4.8MB single-binary that provides 100% OpenAI-compatible endpoints for GGUF models. Point your existing AI tools to Shimmy and they just work — locally, privately, and free.
If you’re reading this, that means you’ve successfully made it through 2025! Allow us to be the first to congratulate you — ...