url_status_scanner
What is the URL Status Scanner module?
The URL Status Scanner module has been developed to allow the tracking of external URLs that are located within the site, to identify broken links, misconfigured or not giving the expected response.
▶ Watch an introduction to URL Status Scanner
Features
For this crawl, the module offers 3 options, scan inside the base code of custom modules and themes; scan inside configuration objects; or also scan inside content entities (such as articles, basic pages, blocks, comments, etc.).
The scan results can be viewed on the dashboard available at the url '/admin/url-status-scanner/dashboard', where you can view information such as the processed URL, response code and its location, that is, for URLs that are within the base code the file path and line number will be indicated; for URLs that are part of the configuration will show the name of the configuration object; and for the URLs found in the content will indicate the type of entity, the name of the bundle, its ID and the name of the field where the processed URL is located.
Additionally, the scan results can be downloaded in CSV or XLSX format for further analysis.
This module is compatible with Drupal multisite projects.
Additional Requirements
This project requires the phpoffice/phpspreadsheet "^4.1" library:
https://packagist.org/packages/phpoffice/phpspreadsheet
How to use URL Status Scanner?
Get the code like any other module.
composer require drupal/url_status_scanner
In the URL 'admin/url-status-scanner/settings' you can select the options in which the URL scanning can be run:
- The "Base code" option will scan URLs in base code files such as modules or custom themes.
- The "Config objects" option will scan URLs in the different configuration objects found on the site.
- The "Content" option will scan URLs in different contents, such as articles, basic pages, blocks, comments, etc. The execution time of this scan depends on the number of content entities and bundles that are found within the site, so that in large sites the scanning can take a long time.