Acknowledgements¶
llama-crab would not exist without the work of the people and
projects listed below. Thank you.
The foundation¶
- llama.cpp —
Georgi Gerganov and the
ggml-orgteam. The C/C++ inference engine thatllama-crabwraps.llama.cppis the reason we can run large language models on consumer hardware. - GGML — the tensor library that powers every backend.
The Rust ecosystem¶
llama-crab stands on the shoulders of a long list of Rust
projects. Highlights:
bindgen— auto-generation of the FFI bindings inllama-crab-sys.cmakeandcc— the C/C++ build glue.serdeandserde_json— request/response types, tool definitions, the JSON-Schema converter.anyhowandthiserror— error handling.tokioandaxum— the HTTP server.tracingandtracing-subscriber— structured logging.sled— the on-disk prompt cache.
A full list lives in the workspace Cargo.lock.
The models¶
The examples in this repository are tested against open-weights models from the Hugging Face Hub. Thank you to:
- Alibaba (Qwen team) — Qwen 2 / 2.5.
- Meta (Llama team) — Llama 3 / 3.1 / 3.2 / 3.3.
- Google (Gemma team) — Gemma 2 / 3 / 4.
- Mistral AI — Mistral and Mixtral.
- Microsoft (Phi team) — Phi-3.
- DeepSeek AI — DeepSeek-V2 / V2.5.
- Liquid AI — LFM2.5-VL.
- Beijing Academy of Artificial Intelligence (BGE team) — BGE embeddings and rerankers.
- Cohere — Command R / R+.
The tools¶
- Material for MkDocs — the documentation site theme.
- Pymdown Extensions — the Markdown extensions used by the docs.
- mdBook — the previous documentation tool. Thank you for the years of service.
The community¶
Thanks to every contributor who has filed an issue, sent a PR, or helped someone in the discussions. The full list lives in the contributors graph.
Where to next?¶
- License — the full text.
- Contributing — how to send a fix for a bug you found.
- Home — back to the documentation home.