Drupal is a registered trademark of Dries Buytaert

document_loader_pdfparser

2 sites No security coverage
View on drupal.org

Enables converting PDFs to Text through a Document Loader plugin via smalot/pdfparser. It enables Drupal modules to register and use PDF parsing in their document processing workflows.

Features

  • Minimal dependencies using straight PHP, without any additional web service requirements
  • Retrieve MetaData from the PDF

Available Inputs

  • PdfInput — PDF document from a File URI

Available Outputs

  • TextOutput — Plain text content

Post-Installation

Visit the Document Loader configuration page to see PDF Parser available.

Additional Requirements

Install with Composer to ensure you have all the required dependencies.

None.

Similar projects

Activity

Total releases
2
First release
Feb 2026
Latest release
2 weeks ago
Release cadence
0 days
Stability
50% stable

Releases

Version Type Release date
1.0.0 Stable Feb 12, 2026
1.0.x-dev Dev Feb 12, 2026