Getting Started¶

Welcome to UniLLM! This guide will help you get up and running with the inference engine.

Overview¶

UniLLM is a Rust-based LLM inference engine that supports 45+ model architectures. In this section, you'll learn how to:

Install UniLLM and its dependencies
Run your first model using the Ollama integration
Understand the basics of the API

Prerequisites¶

Before you begin, ensure you have:

Rust 1.70+ - Install Rust
Git - For cloning the repository
~4GB RAM - For running small models like TinyLlama

Quick Installation¶

# Clone the repository
git clone https://github.com/anthropics/unillm.git
cd unillm

# Build the project
cargo build --release

# Verify the build
cargo test

What's Next?¶

Detailed Installation

Platform-specific instructions and troubleshooting
Your First Model

Step-by-step tutorial to run inference

Quick Test¶

Run a quick test to verify everything is working:

# Run the test suite
cargo test --lib -p runtime

# You should see output like:
# running 166 tests
# ...
# test result: ok. 166 passed; 0 failed

Ollama Integration

The easiest way to get started is using the Ollama integration, which automatically downloads and manages models for you. See Your First Model for details.