Draft: Add rudimentary Excel (.xlsx) support
Hi all, first contribution to Debian.
This is not complete solution yet, because it currently requires tika-app.x.y.z.jar
from the Apache Tika project1 and a helper script. There are also no tests.
There is a Tika package in Debian as libtika-java
, but it doesn't include the -app
JAR file. Is my first step to work with the maintainer(s) of said package to try and get this JAR file included in Debian, or is there a way to get diffoscope to download the JAR and helper script?
Yes, I have an adverse reaction to the second option, too. The helper script doesn't even matter, really, that could just be inlined as a subprocess.check_call
or whatever, no problem. But getting the .jar
is something I'd appreciate advice on how to handle, even if technically I could come up with some kludgy ways to do it.