Ivan Begtin
Ivan Begtin
Hi! I'am building public[ registry](https://github.com/apicrafter/metacrafter-registry) of semantic data types similar to PRONOM for data formats. Is there any document or code file with list of semantic data types supported by...
I have been a data engineer obsessed with metadata for a long time. Recently I started the semantic data types identification project metacrafter - https://github.com/apicrafter/metacrafter, and I wrote a short...
Local files date extraction should be supported too. Required to write proper tests
Instead of dynamic page structure identification generate a template with a number of options that should simplify data parsing afterward. It should include: - location of the container tag -...
The current rule is to use the first link by default. It doesn't work well. Example URL http://pskenergo.ru/news/branch/ instead of a post URL, each time a category URL is detected....
URL https://inspire.ec.europa.eu/news Example: `Monday, January 31, 2022` Need to update qddate patterns
This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...
This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...
Add support of Avro files https://avro.apache.org/docs/1.2.0/spec.html