Drupal is a registered trademark of Dries Buytaert
drupal 11.3.7 Update released for Drupal core (11.3.7)! drupal 11.2.11 Update released for Drupal core (11.2.11)! drupal 10.6.7 Update released for Drupal core (10.6.7)! drupal 10.5.9 Update released for Drupal core (10.5.9)! cms 2.1.1 Update released for Drupal core (2.1.1)! drupal 11.3.6 Update released for Drupal core (11.3.6)! drupal 10.6.6 Update released for Drupal core (10.6.6)! cms 2.1.0 Update released for Drupal core (2.1.0)! bootstrap 8.x-3.40 Minor update available for theme bootstrap (8.x-3.40). menu_link_attributes 8.x-1.7 Minor update available for module menu_link_attributes (8.x-1.7). eca 3.1.1 Minor update available for module eca (3.1.1). layout_paragraphs 2.1.3 Minor update available for module layout_paragraphs (2.1.3). ai 1.3.3 Minor update available for module ai (1.3.3). ai 1.2.14 Minor update available for module ai (1.2.14). node_revision_delete 2.0.3 Minor update available for module node_revision_delete (2.0.3). moderated_content_bulk_publish 2.0.52 Minor update available for module moderated_content_bulk_publish (2.0.52). klaro 3.0.10 Minor update available for module klaro (3.0.10). klaro 3.0.9 Minor update available for module klaro (3.0.9). layout_paragraphs 2.1.2 Minor update available for module layout_paragraphs (2.1.2). geofield_map 11.1.8 Minor update available for module geofield_map (11.1.8).

ai_document_ocr

2 sites No security coverage
View on drupal.org

AI Document OCR Provider

Google Document AI provider for Drupal's AI module

The AI Document OCR Provider module integrates Google Cloud's powerful Document AI service with Drupal's AI module, enabling automated text extraction from PDF files, images, and other document formats through optical character recognition (OCR).

Key Features

  • Seamless AI Module Integration: Works as a provider plugin for Drupal's AI module ecosystem
  • Google Document AI Powered: Leverages Google Cloud's advanced OCR and document understanding technology
  • Multiple Format Support: Process PDFs, JPEG, PNG, GIF, TIFF, BMP, and WebP files
  • Structured Data Extraction: Extract not just text, but also document structure, paragraphs, and layout information
  • Dynamic Processor Management: Automatically loads available processors from your Google Cloud project
  • Secure Credential Storage: Integrates with Drupal's Key module for secure service account management
  • User-Friendly Configuration: Simple setup with AJAX-powered processor selection

Perfect For

  • Content Management: Extract text from uploaded documents for indexing and search
  • Form Processing: OCR invoices, forms, and structured documents
  • Digital Archive Processing: Convert scanned documents to searchable text
  • Accessibility Enhancement: Generate text alternatives for image-based content
  • Automated Workflows: Build AI-powered content processing pipelines

Requirements

  • Drupal 10.0+ or 11.0+
  • AI module
  • Google Cloud account with Document AI API access
  • Key module (recommended for secure credential storage)
  • PHP 8.1+

Quick Setup

  1. Enable the module
  2. Create Google Cloud service account with Document AI access
  3. Store credentials securely using Key module
  4. Configure the provider at Configuration > AI > AI Providers > Document OCR
  5. Select your region and processor - available processors load automatically!

Developer Friendly

Built with modern Drupal standards - clean code, proper dependency injection, comprehensive documentation, and follows AI module conventions for easy integration with existing projects.

Transform your document processing workflows with the power of Google's AI technology, seamlessly integrated into your Drupal site.

Activity

Total releases
3
First release
Aug 2025
Latest release
7 months ago
Release cadence
1 day
Stability
0% stable

Release Timeline

Releases

Version Type Release date
1.0.0-beta2 Pre-release Sep 1, 2025
1.0.0-beta1 Pre-release Sep 1, 2025
1.0.x-dev Dev Aug 31, 2025