Skip to content

Bump tika-core from 2.4.0 to 2.4.1

Bumps tika-core from 2.4.0 to 2.4.1.

Changelog

Sourced from tika-core's changelog.

Release 2.4.2 - ???

  • Add unrar as an optional parser (TIKA-3800).

  • Refactor FuzzingCLI to use PipesParser (TIKA-3799).

  • ServiceLoader's loadServiceProviders() now guarantees unique classes (TIKA-3797).

  • Fix bug that prevented setting of includeHeadersAndFooters for xls, xlsx, doc and docx via tika-config (TIKA-3796).

  • Fix bug that prevented specification of rendered image type via http header in the PDFParser (TIKA-3794).

  • Fix bug causing some Exif dates to be decoded wrongly on timezones different than UTC (TIKA-3815).

Release 2.4.1 - 06/14/2022

  • Implement bulk upload in the OpenSearch emitter (TIKA-3791).

  • Implement tika-server client via pipes mode (TIKA-3790).

  • Custom embedded parsers and EmbeddedDocumentHandlers can now add metadata to the container file's metadata (TIKA-3789).

  • Record embedded file exceptions in the container file's metadata (TIKA-3788).

  • Allow continuation of parsing after write limit has been reached (TIKA-3787).

  • Allow pass-through of 'Content-Length' header to metadata in TikaResource (TIKA-3786).

  • Add embedded depth to profiles tables in tika-eval (TIKA-3775).

  • Add stop() method to TikaServerCli so that it can be run with Apache Commons Daemon (TIKA-1570).

  • Fixed bug in ordering of Parsers during service loading (TIKA-3750).

  • Users can expand system properties from the forking process into forked tika-server processes (TIKA-3748).

  • Fix a few files being wrongly detected as EML (TIKA-3771).

... (truncated)

Commits
  • aa3bfef [maven-release-plugin] prepare release 2.4.1-rc1
  • 98e9cf8 prep for 2.4.1 rc1
  • 0ea6571 TIKA-3792 -- only apply the handler decorator once for legacy xhtml processin...
  • e1892af TIKA-3790 -- fix unit test. sorry.
  • 47de04f TIKA-3779 -- make sure to close the temp stream in PDFParser and clean up aft...
  • 7877f9b TIKA-3790 -- actually implement tika server client via pipes (not yet async)
  • 18ce798 TIKA-3791 -- implement bulk updates in OpenSearch emitter.
  • ac6fe5b TIKA-3751: remove more of netty, update zookeeper to latest
  • 330dff3 TIKA-3751: remove netty, update zookeeper somewhat
  • fae27ea TIKA-3751: Update netty
  • Additional commits viewable in compare view


Dependabot commands
You can trigger Dependabot actions by commenting on this MR
  • $dependabot rebase will rebase this MR
  • $dependabot recreate will recreate this MR rewriting all the manual changes and resolving conflicts

Merge request reports