admin管理员组文章数量:1022982
I'm running Apache Tika Server in a docker container and trying to extract the text from PDFs contained in a password protected ZIP file.
I've tried passing the password in the HTTP header as 'Password' and 'X-Tika-Password', however all it does is list the files in the ZIP folder without extracting the text.
If I remove the password from the ZIP file then it extracts the text from the PDFs perfectly.
I've tried this:
curl --location --request PUT '127.0.0.1:9998/tika' \
--header 'Accept: text/plain' \
--header 'Password: 123456' \
--header 'Content-Type: application/zip' \
--data-binary '@file/path/to.zip'
And just get back plain text with:
Name Of First File.pdf
Name of Second FIle.pdf
I'm running Apache Tika Server in a docker container and trying to extract the text from PDFs contained in a password protected ZIP file.
I've tried passing the password in the HTTP header as 'Password' and 'X-Tika-Password', however all it does is list the files in the ZIP folder without extracting the text.
If I remove the password from the ZIP file then it extracts the text from the PDFs perfectly.
I've tried this:
curl --location --request PUT '127.0.0.1:9998/tika' \
--header 'Accept: text/plain' \
--header 'Password: 123456' \
--header 'Content-Type: application/zip' \
--data-binary '@file/path/to.zip'
And just get back plain text with:
Name Of First File.pdf
Name of Second FIle.pdf
本文标签: How to use Apache Tika Server with password protected filesStack Overflow
版权声明:本文标题:How to use Apache Tika Server with password protected files? - Stack Overflow 内容由热心网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://it.en369.cn/questions/1745594512a2158083.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论