API Reference#

Detailed documentation for the public Go APIs.

  • Generate – text generation, streaming, and sampling options
  • Inference – model loading, GGUF parsing, architecture builders
  • Serve – OpenAI-compatible HTTP server and middleware