Cllm
CA bare-metal C unikernel for serving large language models -- no OS, no overhead.
6 projects · updated 2026-06-06T21:31:50Z · @cognisoc on GitHub
A bare-metal C unikernel for serving large language models -- no OS, no overhead.
Run AI models directly on mobile devices. No cloud. No latency. Complete privacy.
Run local GGUF language models from .NET — one package, one format, one programming model.
Run any LLM locally. Use it from any language. Deploy anywhere.
A modular LLM inference runtime written in Rust.
Learn how LLMs work by building one in Zig -- from tensors to text generation.
No projects match your search.