Ivan Begtin
Ivan Begtin
Consider to add table detection and extraction from non-table files like .docx file format. For example using docx2csv command https://github.com/ivbeg/docx2csv Review other possible data sources and data formats
Add support of ORC files https://cwiki.apache.org/confluence/display/hive/languagemanual+orc
Rewrite code and make data conversion processes universal. Right now it's partially implemented with IterableData and DataWriter classes, but they aren't used in data conversion functions. An ideas how to...
Transfer code from https://github.com/datacoon/datadifflib and implement diff and apply commands to generate and apply patches to JSONl, BSON and CSV files
More commands needed: - sort - sort by number of columns - merge - merge two or more data files - join - join two or more data files -...
Use Google search [status page site:status.\*.\*](https://www.google.com/search?q=status+page+site%3Astatus.*.*) for example or public lists of major companies.
Right now it's just a simple alphabetical list of public status pages. An idea is to reorganize it into groups like hosting, CDNs, finances, and e.t.c. Groups should be sorted...
This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...
This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...
This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...