Drupal is a registered trademark of Dries Buytaert
drupal 10.5.11 Update released for Drupal core (10.5.11)! drupal 11.3.11 Update released for Drupal core (11.3.11)! drupal 11.2.13 Update released for Drupal core (11.2.13)! drupal 10.6.10 Update released for Drupal core (10.6.10)! cms 2.1.2 Update released for Drupal core (2.1.2)! drupal 11.1.10 Update released for Drupal core (11.1.10)! drupal 10.5.10 Update released for Drupal core (10.5.10)! drupal 10.4.10 Update released for Drupal core (10.4.10)! drupal 11.2.12 Update released for Drupal core (11.2.12)! drupal 11.3.10 Update released for Drupal core (11.3.10)! drupal 10.6.9 Update released for Drupal core (10.6.9)! drupal 10.6.8 Update released for Drupal core (10.6.8)! drupal 11.3.9 Update released for Drupal core (11.3.9)! drupal 11.3.8 Update released for Drupal core (11.3.8)! drupal 11.3.7 Update released for Drupal core (11.3.7)! drupal 11.2.11 Update released for Drupal core (11.2.11)! drupal 10.6.7 Update released for Drupal core (10.6.7)! drupal 10.5.9 Update released for Drupal core (10.5.9)! cms 2.1.1 Update released for Drupal core (2.1.1)! drupal 11.3.6 Update released for Drupal core (11.3.6)!

This module allows extracting content from Word and RTF documents for use with Document Loader, using the phpoffice/phpword PHP library.

Supported Input Formats:

  • Word 2007+ (.docx)
  • Word 2003 (.doc)
  • OpenDocument Text (.odt)
  • Rich Text Format (.rtf)

Supported Output Formats:

  • text
  • html
  • markdown

Note on RTF: RTF support is best-effort as PHPWord's RTF reader has limitations. It does not preserve headings or lists, and may drop special characters like smart quotes, accented letters, and dashes.

Requirements

This module requires the following modules:

Installation

composer require drupal/document_loader_phpword

Configuration

  1. Enable the module at Administration > Extend
  2. See PHPWord as an available plugin in the Document Loader configuration at admin/config/media/document-loader

Similar Projects

  • AI File To Text: Leverages the AI module to improve the output of loaded documents

Activity

Total releases
2
First release
May 2026
Latest release
19 hours ago
Release cadence
0 days
Stability
0% stable

Releases

Version Type Release date
1.0.0-alpha1 Pre-release May 29, 2026
1.0.x-dev Dev May 29, 2026