mcrit
mcrit copied to clipboard
The MinHash-based Code Relationship & Investigation Toolkit (MCRIT) is a framework created to simplify the application of the MinHash algorithm in the context of code similarity.
While doing a query on a sample, often the memory usage of a single worker jumps to 10s of GBs - sometime even more than 60GB. It seems like there's...
* think of orphan query samples and functions that don't have a job connected to them * think of doing a DB compact afterwards
Hey :)! It would be cool if the family summary would include the actors associated with it. Somewhat similar to the way Malpedia has this info present: It does change...
* Check if something like this can be used to auto-generate documentation for Falcon: * https://github.com/Javlopez/falcon-apispec * complete doc strings and typing in McritClient
For various purposes, it might be worthwhile to introduce and support universal tagging on the level of families, samples, functions, possibly also matching reports.
By transforming additional DTOs into full python dataclasses, it would likely become possible to use an acceleration library like [mashumaro](https://github.com/Fatal1ty/mashumaro) for the (un)marshalling of MatchingResult, which is very expensive for...
Hey! We see MCRIT as a great tool for malware similarity purposes and want to see if it can be integrated into our malware pipeline, with emphasis on the API...
`MongoDbStorage`'s `insert_many` method should probably check for the total size of the documents or if one of the documents is too big itself. In some (pretty rare) cases, the size...
* possibly make it installable via pip * bind starting the plugin to a shortcut
Fixes #76 Should anything be done on the else branch in line 127? Maybe marking job as failed?