node-tika icon indicating copy to clipboard operation
node-tika copied to clipboard

Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.

Results 12 node-tika issues
Sort by recently updated
recently updated
newest added

[![mitm_build](https://user-images.githubusercontent.com/1323708/59226671-90645200-8ba1-11e9-8ab3-39292bef99e9.jpeg)](https://infosecwriteups.com/want-to-take-over-the-java-ecosystem-all-you-need-is-a-mitm-1fc329d898fb) --- This is a security fix for a high severity vulnerability in your [Apache Maven](https://maven.apache.org/) `pom.xml` file(s). The build files indicate that this project is resolving dependencies over HTTP...

while installing the tika, I am getting node gyp rebuild error which is strange. Changed the configs for python and changed MVS version to 2013 even than unable to install...

[![mitm_build](https://user-images.githubusercontent.com/1323708/59226671-90645200-8ba1-11e9-8ab3-39292bef99e9.jpeg)](https://medium.com/@jonathan.leitschuh/want-to-take-over-the-java-ecosystem-all-you-need-is-a-mitm-1fc329d898fb?source=friends_link&sk=3c99970c55a899ad9ef41f126efcde0e) - [Want to take over the Java ecosystem? All you need is a MITM!](https://medium.com/@jonathan.leitschuh/want-to-take-over-the-java-ecosystem-all-you-need-is-a-mitm-1fc329d898fb?source=friends_link&sk=3c99970c55a899ad9ef41f126efcde0e) - [Update: Want to take over the Java ecosystem? All you need is a MITM!](https://medium.com/bugbountywriteup/update-want-to-take-over-the-java-ecosystem-all-you-need-is-a-mitm-d069d253fe23?source=friends_link&sk=8c8e52a7d57b98d0b7e541665688b454)...

![image](https://cloud.githubusercontent.com/assets/10927/24757836/5e76349a-1aae-11e7-8ea0-abe98432ccb9.png) I am on Mac OS X Sierra. I tried using the "jabba" jvm version manager so I could map to 1.7 before building node-tika, but got errors. Now the...

- Updated `tika` to v1.17. - Updated `node-java` to v0.9.0. - Increased timeout for FTP test and updated link to test file. - Updated encrypted docx test to reflect new...

The current version of node-java doesn't compile on the lastest LTS release of Node. Upgrading to 0.11.0 makes node-tika work on Node 8.12.0 as well as 10.13.0.

The default `User-Agent` HTTP header of `java.net.URLConnection` is `Java/1.8_162`. Some hosting providers filter requests with such a `User-Agent`, e.g. Cloudflare. Below is an example exception. ``` { Error: Error running...

If i want to use the newest tika version do i just have to put another jar in the folder or is there more work to do?

Our scenario is to get .pdf files uploaded in AWS S3 storage and process it later. We want to move to AWS Lambda. However, Lambda requires that the entire package...

Hi there- I'd like to use the BoilerPipeContentHandler to only extract body text from an HTML page. Can anyone suggest a way to make this happen. I don't know much...