Tim Allison

Results 93 comments of Tim Allison

If you want to fix the conflicts, that'd be great...or, this is small enough, I can take it. Let me know. Thank you!

As a side note, we've locked `master` for archival use only now. We're using `main` for the dev branch Tika 2.0.0, and `branch_1x` for the dev branch for Tika 1.25,...

Inertia... I never got around to doing a bakeoff between the two, and, unless there's evidence of improvement, I'm hesitant to make the change as the default detector.

Wait, it turns out I did get around to doing this study... https://github.com/tballison/share/blob/main/slides/Tika_charset_detector_study_201909.docx Let me read it and remember what I found... :rofl:

@tothd91 thank you for opening this! It looks like there are quite a few changes that are white-space only. Would it be possible to update so that the diff includes...

>are you kidding me? you're awesome! all looks great to me. +1 Thank you!

> should we use TestContainers to test this within a docker container to make sure it works? or is it sufficient to just run test only if dwgread is installed?...

> @nddipiazza @tballison This looks messy, can you advise a way to clean it up? A better way of doing it? Still think its worth having the comments there? OMG,...

> > @nddipiazza @tballison This looks messy, can you advise a way to clean it up? A better way of doing it? Still think its worth having the comments there?...

> If I use buffer reader I get the correct output but it's slower: 3s vs 10s (it's quite a large file) > > ``` > > > > //FileInputStream...