Drupal is a registered trademark of Dries Buytaert
cms 2.1.3 Update released for Drupal core (2.1.3)! drupal 10.5.11 Update released for Drupal core (10.5.11)! drupal 11.3.11 Update released for Drupal core (11.3.11)! drupal 11.2.13 Update released for Drupal core (11.2.13)! drupal 10.6.10 Update released for Drupal core (10.6.10)! cms 2.1.2 Update released for Drupal core (2.1.2)! drupal 11.1.10 Update released for Drupal core (11.1.10)! drupal 10.5.10 Update released for Drupal core (10.5.10)! drupal 10.4.10 Update released for Drupal core (10.4.10)! drupal 11.2.12 Update released for Drupal core (11.2.12)! drupal 11.3.10 Update released for Drupal core (11.3.10)! drupal 10.6.9 Update released for Drupal core (10.6.9)! drupal 10.6.8 Update released for Drupal core (10.6.8)! drupal 11.3.9 Update released for Drupal core (11.3.9)! drupal 11.3.8 Update released for Drupal core (11.3.8)! drupal 11.3.7 Update released for Drupal core (11.3.7)! drupal 11.2.11 Update released for Drupal core (11.2.11)! drupal 10.6.7 Update released for Drupal core (10.6.7)! drupal 10.5.9 Update released for Drupal core (10.5.9)! cms 2.1.1 Update released for Drupal core (2.1.1)!

Enables extracting Text from PDFs through a Document Loader plugin with the PDF Parser PHP library. It enables Drupal modules to register and use PDF parsing in their document processing workflows.

Features

  • Extracts the text from PDFs to be used through Document Loader
  • Minimal dependencies using straight PHP, without any additional web service requirements
  • Retrieve MetaData from the PDF (page count, author, etc)

Available Inputs

  • PdfInput — PDF document from a File URI

Available Outputs

  • TextOutput — Plain text content

Post-Installation

Visit the Document Loader configuration page to see PDF Parser available.

Additional Requirements

Install with Composer to ensure you have all the required dependencies:

composer require drupal/document_loader_pdfparser

None.

Similar projects

Activity

Total releases
3
First release
Feb 2026
Latest release
2 months ago
Release cadence
14 days
Stability
67% stable

Release Timeline

Releases

Version Type Release date
1.1.0 Stable Mar 11, 2026
1.0.0 Stable Feb 12, 2026
1.0.x-dev Dev Feb 12, 2026