Skip to content

ZigLlama

Tags

Initializing search

cognisoc/zigllm

Home
Getting Started
Architecture
Layer 1: Foundations
Layer 2: Linear Algebra
Layer 3: Neural Primitives
Layer 4: Transformers
Layer 5: Models
Layer 6: Inference
Tools & Server
API Reference
Examples & Tutorials
Performance
References

ZigLlama

cognisoc/zigllm

Home
Getting Started
Getting Started
Architecture
Architecture
Layer 1: Foundations
Layer 1: Foundations
Layer 2: Linear Algebra
Layer 2: Linear Algebra
Layer 3: Neural Primitives
Layer 3: Neural Primitives
Layer 4: Transformers
Layer 4: Transformers
Layer 5: Models
Layer 5: Models
- Model Configuration
- Tokenization
- Chat Templates
- GGUF Model Loading
- LLaMA / LLaMA 2
- Mistral
- GPT-2
- Falcon
- Qwen
- Phi
- GPT-J
- GPT-NeoX
- BLOOM
- Mamba
- BERT
- Gemma
- StarCoder
- Mixture of Experts
- Multi-Modal
Layer 6: Inference
Layer 6: Inference
Tools & Server
Tools & Server
API Reference
API Reference
Examples & Tutorials
Examples & Tutorials
Performance
Performance
References
References

Tags¶

Copyright 2024–2026 ZigLlama Contributors — MIT License

Made with Material for MkDocs