tika-python icon indicating copy to clipboard operation
tika-python copied to clipboard

Have `.from_file()` methods accept file objects as input

Open edwin-lai opened this issue 5 years ago • 4 comments

Currently the .from_file() methods only accept urls and filepaths as input. I would like it to accept file objects as well so applications that have already opened the files don't have to write the file again for Tika to consume. I am willing submit a pull request that implements this feature.

edwin-lai avatar Dec 13 '19 21:12 edwin-lai

thanks @edwin-lai i'll watch for the PR!

chrismattmann avatar Dec 13 '19 21:12 chrismattmann

@edwin-lai Can you please make sure that when you accept an open file stream from an outside source, you leave the stream open? Tika's auto-close behavior is unexpected.

nycki93 avatar Feb 18 '20 23:02 nycki93

Hey,

Is this implemented already, i have file as byte code and i want to extract content from that without writing that byte content to a file, can we achieve that in pyhton?

Thanks Deepak Jindal [email protected] 7838068727

deepakjindal90 avatar Jul 16 '20 22:07 deepakjindal90

not to my knowledge, no @deepakjindal90

chrismattmann avatar Jul 21 '20 01:07 chrismattmann

fixed in #316

chrismattmann avatar Dec 31 '22 21:12 chrismattmann