Skip to content

Draft: Add rudimentary Excel (.xlsx) support

Hi all, first contribution to Debian.

This is not complete solution yet, because it currently requires tika-app.x.y.z.jar from the Apache Tika project1 and a helper script. There are also no tests.

There is a Tika package in Debian as libtika-java, but it doesn't include the -app JAR file. Is my first step to work with the maintainer(s) of said package to try and get this JAR file included in Debian, or is there a way to get diffoscope to download the JAR and helper script?

Yes, I have an adverse reaction to the second option, too. The helper script doesn't even matter, really, that could just be inlined as a subprocess.check_call or whatever, no problem. But getting the .jar is something I'd appreciate advice on how to handle, even if technically I could come up with some kludgy ways to do it.

  1. https://tika.apache.org/download.html

Edited by Kevin Ernst

Merge request reports

Loading