Drupal is a registered trademark of Dries Buytaert

ai_document_ocr

1 sites No security coverage
View on drupal.org

AI Document OCR Provider

Google Document AI provider for Drupal's AI module

The AI Document OCR Provider module integrates Google Cloud's powerful Document AI service with Drupal's AI module, enabling automated text extraction from PDF files, images, and other document formats through optical character recognition (OCR).

Key Features

  • Seamless AI Module Integration: Works as a provider plugin for Drupal's AI module ecosystem
  • Google Document AI Powered: Leverages Google Cloud's advanced OCR and document understanding technology
  • Multiple Format Support: Process PDFs, JPEG, PNG, GIF, TIFF, BMP, and WebP files
  • Structured Data Extraction: Extract not just text, but also document structure, paragraphs, and layout information
  • Dynamic Processor Management: Automatically loads available processors from your Google Cloud project
  • Secure Credential Storage: Integrates with Drupal's Key module for secure service account management
  • User-Friendly Configuration: Simple setup with AJAX-powered processor selection

Perfect For

  • Content Management: Extract text from uploaded documents for indexing and search
  • Form Processing: OCR invoices, forms, and structured documents
  • Digital Archive Processing: Convert scanned documents to searchable text
  • Accessibility Enhancement: Generate text alternatives for image-based content
  • Automated Workflows: Build AI-powered content processing pipelines

Requirements

  • Drupal 10.0+ or 11.0+
  • AI module
  • Google Cloud account with Document AI API access
  • Key module (recommended for secure credential storage)
  • PHP 8.1+

Quick Setup

  1. Enable the module
  2. Create Google Cloud service account with Document AI access
  3. Store credentials securely using Key module
  4. Configure the provider at Configuration > AI > AI Providers > Document OCR
  5. Select your region and processor - available processors load automatically!

Developer Friendly

Built with modern Drupal standards - clean code, proper dependency injection, comprehensive documentation, and follows AI module conventions for easy integration with existing projects.

Transform your document processing workflows with the power of Google's AI technology, seamlessly integrated into your Drupal site.

Activity

Total releases
3
First release
Aug 2025
Latest release
6 months ago
Release cadence
1 day
Stability
0% stable

Release Timeline

Releases

Version Type Release date
1.0.0-beta2 Pre-release Sep 1, 2025
1.0.0-beta1 Pre-release Sep 1, 2025
1.0.x-dev Dev Aug 31, 2025