Uploaded image for project: 'Sakai'
  1. Sakai
  2. SAK-45331

Update Apache Tika 1.26

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: RESOLVED
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 20.2
    • Fix Version/s: 22.0 [Tentative]
    • Component/s: Master
    • Labels:
      None
    • 21 Status:
      Please Merge
    • 20 status:
      Please Merge
    • Test Plan:

      Description

      Release 1.26 - 03/24/2021

      • Fix thread safety bug in OpenOffice parser (TIKA-3334).
      • The "writeLimit" header now pertains to the combined characters
        written per container document (and embedded documents) in the /rmeta
        endpoint in tika-server (TIKA-3325); it no longer functions only
        per container or embedded document.
      • Extract more embedded files in PDFs by recursively processing the
        embedded file tree (TIKA-3332).
      • Allow for case insensitive headers for configuration of the PDFParser
        and the TesseractOCRParser in tika-server via Subhajit Das (TIKA-3320).
      • Improve detection and parsing of XPS files (TIKA-3316).
      • General dependency upgrades (TIKA-3244).
      • Great optimization in ForkParser (TIKA-3237).
      • Fix parsing of emails attached to other emails in PST files (TIKA-3004).
      • MP3 parser should output the xmpDM:duration metadata as seconds not
        milliseconds, consistent with the other Audio and Video parsers (TIKA-3318).
      • MP4 parser check if any of the Compatible Brands match when identifying
        the subtype (TIKA-3310).

        Gliffy Diagrams

          Zeplin

            Attachments

              Activity

                People

                Assignee:
                dhorwitz David Horwitz
                Reporter:
                dhorwitz David Horwitz
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                  Dates

                  Created:
                  Updated:
                  Resolved:

                    Git Integration