Drupal is a registered trademark of Dries Buytaert
drupal 10.5.11 Update released for Drupal core (10.5.11)! drupal 11.3.11 Update released for Drupal core (11.3.11)! drupal 11.2.13 Update released for Drupal core (11.2.13)! drupal 10.6.10 Update released for Drupal core (10.6.10)! cms 2.1.2 Update released for Drupal core (2.1.2)! drupal 11.1.10 Update released for Drupal core (11.1.10)! drupal 10.5.10 Update released for Drupal core (10.5.10)! drupal 10.4.10 Update released for Drupal core (10.4.10)! drupal 11.2.12 Update released for Drupal core (11.2.12)! drupal 11.3.10 Update released for Drupal core (11.3.10)! drupal 10.6.9 Update released for Drupal core (10.6.9)! drupal 10.6.8 Update released for Drupal core (10.6.8)! drupal 11.3.9 Update released for Drupal core (11.3.9)! drupal 11.3.8 Update released for Drupal core (11.3.8)! drupal 11.3.7 Update released for Drupal core (11.3.7)! drupal 11.2.11 Update released for Drupal core (11.2.11)! drupal 10.6.7 Update released for Drupal core (10.6.7)! drupal 10.5.9 Update released for Drupal core (10.5.9)! cms 2.1.1 Update released for Drupal core (2.1.1)! drupal 11.3.6 Update released for Drupal core (11.3.6)!

ai_audio_generator

1 sites No security coverage
View on drupal.org

Leverages either Google TTS or AI Text to Speech (e.g. Gemini, ElevenLabs, OpenAI, etc) to generate long audio files using a curated view mode controlled version of your content item using natural voice.

Features

  1. Provides a 'Save and generate audio' button next to save to allow content editors to trigger generation on demand when they have made a notable change.
  2. Runs via batch with chunking breaking at natural pauses (between paragraphs where possible, otherwise sentences) to ensure smooth audio when combining the per chunk audio clips.
  3. Handles any length of content.
  4. Provides voice control (dictated by e.g. OpenAI, Gemini, or other AI Provider options) or via Google TTS options, including language selection like en-GB instead of en-US.
  5. Allows SSML pronunciation control dictionary management (basic) when using Google TTS (or basic phonetical find and replace for non SSML AI generated voices).
  6. Text being read has fine-grained controlled via View Mode and template
  7. Automated test coverage for reliability and stability. Security coverage opt-in once stable release made.

Post-Installation

See the README.md file (to be transferred here eventually).

Optionally composer require google/cloud-text-to-speech if you prefer to use Google TTS; it is cheaper for volume but sounds less natural.

Similar projects

  • https://www.drupal.org/project/tts - Does not handle batch, generates on demand rather than editor in control. Offers more out of the box rendering whereas this module simply produces the audio media file. Does not give the view mode and template control needed to curate the text.
  • https://www.drupal.org/project/ai -> AI Automators - The automators module can generate simple audio files but does not handle joining large chunks together. The UI is quite different focused on simple on demand changes rather than giving the site editor control (which is important to manage costs on large reports) and does not give the view mode and template control needed to curate the text.
  • https://www.drupal.org/project/elevenlabs - Uses AI Automators under the hood. It however also provides an AI provider, so ElevenLabs can be set as the provider.
  • https://git.drupalcode.org/project/ai_audio_field - Relies on ffmpeg being installed to handle large file merging (which is not available on many hosts) and ultimately uses the AI Automator UI

Activity

Total releases
4
First release
Apr 2026
Latest release
1 month ago
Release cadence
0 days
Stability
0% stable

Release Timeline

Releases

Version Type Release date
1.0.0-alpha3 Pre-release Apr 16, 2026
1.0.0-alpha2 Pre-release Apr 16, 2026
1.0.0-alpha1 Pre-release Apr 15, 2026
1.0.x-dev Dev Apr 15, 2026