Skip to content

Bump tika-core from 2.5.0 to 2.6.0

Bumps tika-core from 2.5.0 to 2.6.0.

Changelog

Sourced from tika-core's changelog.

Release 2.6.0 - 11/3/2022

  • Add optional Siegfried detector (TIKA-3901).

  • Move OverrideDetector's functionality to the CompositeDetector (TIKA-3904).

  • The FileCommandDetector has been refactored to have the same behavior as the Siegfried detector; see setUseMime in the javadoc (TIKA-3902).

  • Fix bug in OpenSearch emitter that prevented upserts on documents with embedded files (TIKA-3882).

  • Extract PDF actions and triggers into the file's metadata (TIKA-3887).

  • Add a tika-async-cli module (TIKA-3885).

  • Fetch keys sent via headers to tika server are now URL decoded (TIKA-3864).

Release 2.5.0 - 09/30/2022

  • Improved extraction of PDF subset info for PDF/UA, PDF/VT, and PDF/X. NOTE: we no longer append PDF/A information, e.g. 'version="A-1b"' to the 'dc:format'. Users must now get that information from the 'pdfa:PDFVersion' key or from 'pdfaid:conformance' and 'pdfaid:part' (TIKA-3844).

  • Avoid infinite loop in bookmark extraction from PDFs (TIKA-3832).

  • Upgraded to slf4j 2.0.1 (TIKA-3842).

  • Added upsert option for the OpenSearch emitter (TIKA-3855).

  • Extract PDF signature information at the document level into the metadata (TIKA-3852).

  • Enable configuration of digests via AutoDetectParserConfig (TIKA-3853).

  • Use commons-io byte array streams via PJ Fanning (TIKA-3843).

  • Upgrade to PDFBox 2.0.27 (TIKA-3866).

  • Upgrade to JempBox 1.8.17 (TIKA-3856).

  • Add extraction of ODF version from ODF files (TIKA-3840).

  • tika-parser-html-commons (BoilerPipeHandler) is no longer a a dependency of tika-parser-html-module. tika-app and tika-server-standard have added a dependency on tika-parser-html-commons. However, users who are managing custom dependencies and who want the BoilerPipeHandler

... (truncated)

Commits
  • 41319f3 [maven-release-plugin] prepare release 2.6.0-rc1
  • aec8029 Binary incompatibility with updated maven release plugin, try to update scm a...
  • d9040f4 Merge remote-tracking branch 'origin/main'
  • 89f0821 add release date, fix rat problems, update 2.5.1 -> 2.6.0 for next release cycle
  • 9911dd9 Merge pull request #784 from apache/dependabot/maven/org.apache.maven.plugins...
  • 71d6aca Merge pull request #783 from apache/dependabot/maven/aws.version-1.12.334
  • f6d80df Bump maven-release-plugin from 3.0.0-M6 to 3.0.0-M7
  • 02bd6f7 Bump aws.version from 1.12.333 to 1.12.334
  • dfc99d6 Merge pull request #782 from apache/dependabot/maven/aws.version-1.12.333
  • 1a0d6ed Bump aws.version from 1.12.332 to 1.12.333
  • Additional commits viewable in compare view


Dependabot commands
You can trigger Dependabot actions by commenting on this MR
  • $dependabot rebase will rebase this MR
  • $dependabot recreate will recreate this MR rewriting all the manual changes and resolving conflicts

Merge request reports

Loading