metacrafter icon indicating copy to clipboard operation
metacrafter copied to clipboard

Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules

Results 19 metacrafter issues
Sort by recently updated
recently updated
newest added

This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to fix one or more vulnerable packages in the `pip` dependencies of...

Hi, I'm in the process of setting up Datahub (https://datahubproject.io) at our organisation and I wanted to know if there is a way to load the Metacrafter PII labels onto...

enhancement

Add support for the following NoSQL databases and search engines: MongoDB, ArangoDB, Milvus, ArcadeDB, ElasticSearch, OpenSearch, MeiliSearch, Apache Cassandra, StarGate (MongoDB-like API over NoSQL databases) The current state of database...

enhancement

Error `Object of type bytes is not JSON serializable` caused by table fields with bytes type. Better detection of types needed and serialization of bytes type in JSON report. Error...

bug

Error processing SQLite database with non-unicode names for fields. Example [000012_world.zip](https://github.com/apicrafter/metacrafter/files/9306204/000012_world.zip) `Traceback (most recent call last): File "C:\Users\ibegt\AppData\Roaming\Python\Python310\site-packages\sqlalchemy\engine\result.py", line 1284, in fetchall l = self.process_rows(self._fetchall_impl()) File "C:\Users\ibegt\AppData\Roaming\Python\Python310\site-packages\sqlalchemy\engine\result.py", line 1230, in...

bug

Right now JSON file of the metadata scanning report is not structured well enough. Improvements should include: - [ ] Add Cerberus schema (more info https://docs.python-cerberus.org) - [ ] Add...

enhancement

Error processing several SQLite files `(sqlite3.OperationalError) no such tokenizer: PSITokenizer` Example file [001607_psi.zip](https://github.com/apicrafter/metacrafter/files/9313446/001607_psi.zip)

bug

Right now report include only: field name, data type, tags, semantic type id and registry URL. Sometimes additional information required and it's collected during matching process. Consider to add to...

enhancement

Support XML files with following list of tasks: - [x] Support XML files with XML tag name provided - [ ] Add examples to documentation - [ ] Collect examples...

enhancement

Automate detection of empty values and exclude them from data analysis. Possible empty values: None, 'N/A', empty string, 'NaN', 'None', '-' The following actions are required: - [ ] Add...

enhancement