Drupal is a registered trademark of Dries Buytaert
cms 2.1.3 Update released for Drupal core (2.1.3)! drupal 10.5.11 Update released for Drupal core (10.5.11)! drupal 11.3.11 Update released for Drupal core (11.3.11)! drupal 11.2.13 Update released for Drupal core (11.2.13)! drupal 10.6.10 Update released for Drupal core (10.6.10)! cms 2.1.2 Update released for Drupal core (2.1.2)! drupal 11.1.10 Update released for Drupal core (11.1.10)! drupal 10.5.10 Update released for Drupal core (10.5.10)! drupal 10.4.10 Update released for Drupal core (10.4.10)! drupal 11.2.12 Update released for Drupal core (11.2.12)! drupal 11.3.10 Update released for Drupal core (11.3.10)! drupal 10.6.9 Update released for Drupal core (10.6.9)! drupal 10.6.8 Update released for Drupal core (10.6.8)! drupal 11.3.9 Update released for Drupal core (11.3.9)! drupal 11.3.8 Update released for Drupal core (11.3.8)! drupal 11.3.7 Update released for Drupal core (11.3.7)! drupal 11.2.11 Update released for Drupal core (11.2.11)! drupal 10.6.7 Update released for Drupal core (10.6.7)! drupal 10.5.9 Update released for Drupal core (10.5.9)! cms 2.1.1 Update released for Drupal core (2.1.1)!

This module integrates llama.cpp with the AI module for Drupal, enabling local and self-hosted AI inference without any external service or API key.
It connects to a running llama-server instance through its OpenAI-compatible /v1 HTTP API, which means it works with any model in GGUF format that llama.cpp supports.

Supported operations:
- Chat completions
- Embeddings

Features

- Auto-discovers available models from the server at /v1/models
- Caches the model list in Drupal State so the site stays functional if the server is temporarily offline
- No API key required — designed for local development and self-hosted deployments
- Works in DDEV and Docker environments via http://host.docker.internal

Additional Requirements

- AI module 1.2 or later
- A running llama-server (included in llama.cpp, default port 8080)

Activity

Total releases
1
First release
Jun 2026
Latest release
1 day ago
Release cadence
Stability
100% stable

Releases

Version Type Release date
1.0.0 Stable Jun 1, 2026