Uploaded image for project: 'Sakai'
  1. Sakai
  2. SAK-44742

Update Apache Tika 1.25

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: RESOLVED
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 20.1
    • Fix Version/s: 22.0 [Tentative]
    • Component/s: Master, Search
    • Labels:
      None
    • Test Plan:

      Description

      Release 1.25 - 11/25/2020

      • Fix inconsistent license in xmpcore (TIKA-3204).
      • General upgrades including some dependencies with
        recently found security vulnerabilities (TIKA-3119).
      • Add detection and a parser for flat ODF files (TIKA-3159).
      • Add extraction of macros from ODF files (TIKA-3161).
      • Add mime detection for hprof and hprof text files (TIKA-3144).
      • Add TextSignature and TextProfileSignature to tika-eval (TIKA-3145 and TIKA-3146)
      • Create a metadata filter to trigger tika-eval stats post parsing (TIKA-3140)
      • Add a configurable metadata-filter for the RecursiveParserWrapper (TIKA-3137)
      • Add status endpoint to tika-server (TIKA-3129).
      • Remove whitelist/blacklist terminology (TIKA-3120)
      • Add detection for parquet files (TIKA-3115).
      • Add detection and parsing for bplist (TIKA-3104).
      • Enable metadata value filtering for RecursiveParserWrapper (TIKA-3137)
      • Add a basic parser for plist files based on com.googlecode.plist:dd-plist (TIKA-3104).
      • Read hyperlinked images from ODT files (TIKA-3156).
      • Updated GrobidRESTParser to use new API location (TIKA-3191).
      • Add FileProfiler to tika-eval (TIKA-3216).
      • Add status endpoint to tika-server (TIKA-3129).
      • Improved handling of zip files with STORED entries with
        data descriptor (TIKA-3196).
      • Add parsers for XLZ, IDML and MIF (TIKA-2976, TIKA-3188 and TIKA-3189).
      • Add the beginnings of a format-aware fuzzing module (TIKA-3083).
      • Add wrapper for Linux 'file' command for mime detection (TIKA-3215).
      • Added ability to skip parsing of embedded files in Tika Server (TIKA-3227).

        Gliffy Diagrams

          Zeplin

            Attachments

              Activity

                People

                Assignee:
                dhorwitz David Horwitz
                Reporter:
                dhorwitz David Horwitz
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                  Dates

                  Created:
                  Updated:
                  Resolved:

                    Git Integration