Adrien Barbaresi

Results 412 comments of Adrien Barbaresi

Hi @juanjoDiaz, thanks for the feedback, that's odd indeed. Words written in all caps currently remain untouched in case they are acronyms (e.g. BRICS). That being said it is safe...

Thanks for the added docs and good point above, you could actually open an issue regarding the harmonization of `Lemmatizer` and `LanguageDetector`. It's not the priority now though, so we...

- CSV: add column - JSON: add in metadata - XML: add attribute - XML TEI: `profileDesc/creation/date[@type="download"]`

Thank you for your feedback, the output is weird because the text is contained by a `` element which isn't found by rule-based XPath expressions because it's rare or not...

@felipehertzer Can you try adding it to your PR in #509? `ends-with(@class, '-article')` could work, I don't remember if it's supported by LXML.

@felipehertzer Yes, let's try that.

Hi @edkrueger, thanks for your feedback. I agree there is something missing here but in my opinion it's more a documentation issue. Rather than risking to break existing code I'd...

The problem is that `fetch_url()` is mentioned in various places on the Internet (not just in the docs) with its current behavior and people are going to wonder why it...

@edkrueger What do you think and do you have time to work on the PR?

To sum up, here is what I'd suggest in order to implement useful changes step-by-step: 1. Create a new function to replace the `decode=False` behavior, say `fetch_response()` *hint: a new...