Tim Allison

Results 93 comments of Tim Allison

Can you guarantee that reading per line will be ok on this json-disaster? If so, that's the way to go. The other thing is that you'll want to specify the...

Can you tell if they're writing utf8? Are there any ascii accented data items or non-ascii characters that you can use to figure out what they're default encoding is?

If NaN is the only problem, is there any way to tell jackson to be lax? Maybe something like: https://stackoverflow.com/questions/63516411/convert-nan-value-into-null-when-parsing-json-using-jackson-library

No, that probably won't work. Sorry. If you send me some examples, I can try some things.

@Gagravarr any objections? Would be useful to open a ticket on our JIRA to track to this. Also would be helpful to add unit tests with files that you've provided....

I realize this will require quite a bit more refactoring. Would it make sense to create a new parser DWGExecParser that runs dwgread if it is installed, otherwise falls back...

Oh, and if you're on a mac, double-click the quine.gz for entertainment. I've reported it to Apple, but they haven't fixed it yet that I'm aware. Make sure to have...

Upgrading Tika early and often is a good idea. Let me know if you want to chat about migrating to >= 2.1.0.

If you have time, I'd recommend going straight to 2.4.1. There aren't that many diffs/changes within 2.x. This is the documentation we've put together: https://cwiki.apache.org/confluence/display/TIKA/Migrating+to+Tika+2.0.0 The 2.5.0 release should happen...

Any chance you'd be willing to share your test file? We just added a dbf parser on [Apache Tika](https://issues.apache.org/jira/browse/TIKA-1513) and we need examples of Float fields.