“”

Metadata and content analysis service Metadata and content analysis service

extractor / stable

This extension detects and extracts metadata (EXIF / IPTC / XMP / ...) from potentially thousand different file types (such as MS Word/Powerpoint/Excel documents, PDF and images) and bring them automatically and natively to TYPO3 when uploading assets. Works with built-in PHP functions but takes advantage of Apache Tika and other external tools for enhanced metadata extraction.

This version supports TYPO3

Older versions also support TYPO3

Composer support

composer req causal/extractor

Tags

#metadata #iptc #exif #xmp #upload #fal #tika

Last upload comment

This is mainly a bug fix release based on problems reported in the bug tracker.

The version is raised to 2.5.0 because we explicitly (and hopefully temporarily) disabled the mapping configuration tool in Extension Settings actually only works prior to TYPO3 v11. Reason is that custom JS is not interpreted anymore. We hopefully will find a way out of it but instead of being "broken", it is now disabled.

The other reason is that this release brings early support of TYPO3 v13 while keeping the backward compatibility with former LTS versions and as part of it, we spotted that the metadata file field "copyright" has been enlarged. This is something that we replicated in the database schema we provide and thus a DB update is suggested (although not absolutely necessary) for TYPO3 installations prior to TYPO3 v13.

Downloads by month

Download 2.5.0 Extension Manual Found an Issue? Code Insights Packagist.org Donate and Give Kudos
Author
Xavier Perseguers
Company
Causal Sàrl
Last update
03. Mar 2024
First upload
15. Mar 2014
Downloads
113,906
Category
Services
Dependencies

Installation

  1. Download ZIP file
  2. Log into your TYPO3 backend
  3. Go to Extension Manager module
  4. Press the upload button on the top bar
  5. Select the ZIP file and upload it. If you want to overwrite an existing extension installation, activate the checkbox.
  1. Go to your folder where the root composer.json file is located
  2. Type: composer req causal/extractor to get the latest version that runs on your TYPO3 version.
  1. Download T3X file
  2. Log into your TYPO3 backend
  3. Go to Extension Manager module
  4. Press the upload button on the top bar
  5. Select the T3X file and upload it. If you want to overwrite an existing extension installation, activate the checkbox.

Other extensions by this user (12)

Version history

Version Upload comment Works with TYPO3 Download
2.5.0 / stable
March 03, 2024
This is mainly a bug fix release based on problems reported in the bug tracker.

The version is raised to 2.5.0 because we explicitly (and hopefully temporarily) disabled the mapping configuration tool in Extension Settings actually only works prior to TYPO3 v11. Reason is that custom JS is not interpreted anymore. We hopefully will find a way out of it but instead of being "broken", it is now disabled.

The other reason is that this release brings early support of TYPO3 v13 while keeping the backward compatibility with former LTS versions and as part of it, we spotted that the metadata file field "copyright" has been enlarged. This is something that we replicated in the database schema we provide and thus a DB update is suggested (although not absolutely necessary) for TYPO3 installations prior to TYPO3 v13.
8.7.0 - 13.0.99 Download
2.4.4 / stable
January 31, 2024
Standard maintenance release.
8.7.0 - 12.4.99 Download
2.4.3 / stable
November 26, 2023
Since TYPO3 v12 (and v11 if optionally using typo3/cms-composer-installers v4), all extensions are located within vendor/ directory.
This bugfix release addresses this change and allows file metadata's TCA to be properly overridden for images, thus pleasing all the photographers among us with many useful additional camera-related fields such as make, model, lens, shutter speed and so on...
8.7.0 - 12.4.99 Download
2.4.2 / stable
November 19, 2023
Various bug fixes.
8.7.0 - 12.4.99 Download
2.4.1 / stable
November 07, 2023
Compatibility with TYPO3 v12, including the getid3 library! (for those of you still not using composer).
8.7.0 - 12.4.99 Download
2.4.0 / stable
November 07, 2023
Compatibility with TYPO3 v12.
8.7.0 - 12.4.99 Download
2.3.1 / stable
November 03, 2023
Bugfix release to include the missing getid3 library (not a problem with composer-based install).
8.7.0 - 11.5.99 Download
2.3.0 / stable
May 10, 2023
Standard maintenance release.
8.7.0 - 11.5.99 Download
2.2.0 / stable
April 20, 2022
Standard maintenance release.
8.7.0 - 11.5.99 Download
2.1.0 / stable
January 22, 2021
Feature release allowing to dynamically assign TYPO3 categories to the corresponding file when using signal/slog or event/listener scheme.
8.7.0 - 11.5.99 Download
2.0.0 / stable
January 20, 2021
- Compatibility with TYPO3 v11
- BEWARE: Signal/Slot after metadata extraction has vanished and is now an PSR-14 event in TYPO3 v10. Check documentation (Developer Manual) if needed.
8.7.0 - 11.5.99 Download
1.7.3 / stable
August 03, 2020
This is a bug fix release. For some users the extraction service would not be properly executed when uploading a file. This was related to a workaround we implemented to work around a design flaw in FAL which is re-extracting metadata when moving files around.
8.7.0 - 10.4.99 Download
1.7.2 / stable
April 28, 2020
Bug fix release for a compatibility issue in TYPO3 v8 and v9.
8.7.0 - 10.4.99 Download
1.7.1 / stable
April 23, 2020
This release mitigates a design flaw for TYPO3 FAL present since TYPO3 6.2 LTS: when moving a file in fileadmin, FAL reindexes it afterwards, extracts again the metadata from physical file and replaces them in database, thus effectively leading to loss of information if a user manually edited the metadata. Problem is reported but is not yet fixed and will not be fixed for TYPO3 v8 anyway (possibly with ELTS plan? Even not sure). So for v8 users and as long it is not fixed in Core, this extension will simply "refuse" to extract metadata if they are already present in file.

In addition a bug has been fixed where some PDF whose metadata where encoded as UTF-16BE in octal representation would lead to extracted title and other fields like "\376\377\000D\000o\000c..." which is absolutely useless!

Finally a deprecated call was still present in TYPO3 v8 and is now migrated.
8.7.0 - 10.4.99 Download
1.7.0 / stable
April 20, 2020
Long-awaited release with support for TYPO3 v8, TYPO3 v9 and TYPO3 v10 so that it eases your migration path between LTS versions.

Now with enhanced support for OPCache when not using composer, fixed support for encrypted PDF and last but not least, a better icon since this extension is about extracting and analyzing metadata and not "searching" for files.
8.7.0 - 10.4.99 Download
1.6.0 / stable
May 15, 2018
Official compatibility with PHP 7.2.
7.6.0 - 8.7.99 Download
1.5.1 / stable
June 01, 2017
Regular bug fix release.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_151
6.2.0 - 8.7.99 Download
1.5.0 / stable
April 25, 2017
Map IPTC categories to TYPO3 system categories with PHP native extraction.
TYPO3 logger lets you follow the metadata extraction process.
Various bug fixes.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_150
6.2.0 - 8.7.99 Download
1.4.0 / stable
November 03, 2016
Map IPTC categories to TYPO3 system categories.
Fix crash when using PHP 7.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_140
6.2.0 - 8.99.99 Download
1.3.2 / stable
July 14, 2016
Regular bug fix release for TYPO3 v8.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_132
6.2.0 - 8.99.99 Download
1.3.1 / stable
March 30, 2016
Compatibility with TYPO3 v8.
Update getID3 library.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_131
6.2.0 - 8.99.99 Download
1.3.0 / stable
February 08, 2016
Handy tool to configure your own mapping configuration.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_130
6.2.0 - 7.99.99 Download
1.2.2 / stable
January 15, 2016
Fix missing categories tab for images.
Fix "auto_extract" configuration in TYPO3 6.2.
Update getID3 library.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_122
6.2.0 - 7.99.99 Download
1.2.1 / stable
December 03, 2015
Extraction could fail to be triggered in TYPO3 7.
Better handling of PDF metadata using native PHP extraction.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_121
6.2.0 - 7.99.99 Download
1.2.0 / stable
November 09, 2015
Loads of additional file types analysed in native PHP.
Takes EXT:media changes to TCA into account. Use of EXT:metadata is however discouraged (and useless anyway).
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_120
6.2.0 - 7.99.99 Download
1.1.0 / stable
October 26, 2015
Streamlined metadata user interface in 6.2 as in 7.
Enhanced autonomous extraction with PHP (docx, xlsx, pdf, jpg, ...).
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_110
6.2.0 - 7.99.99 Download
1.0.0 / stable
October 19, 2015
Extension is not acting as a bridge anymore since the former dependency extensions lack any plausible sign of life.
Details on https://forge.typo3.org/projects/extension-extractor/wiki/EXTRACTOR_100
6.2.0 - 7.99.99 Download
0.2.0 / beta
March 16, 2014
Support for Apache Tika. Configuration option to extract metadata on-the-fly during upload (no need for FAL scheduler task).
6.2.0 - 6.2.99 Download
0.1.0 / alpha
March 15, 2014
Initial release to gather feedback.
6.2.0 - 6.2.99 Download