Floyd
Floyd
With the addition of `data_types` in #219, we no longer need kwargs like:https://github.com/fhightower/ioc-finder/blob/e531ca13b77df941202afa8e96c61082bdcf6cad/ioc_finder/ioc_finder.py#L410. We should deprecate and then ticket the removal of these kwargs.
The `root_key_list` variable in the grammar file should be moved to the data file
Currently, the [scheme-less URL grammar](https://github.com/fhightower/ioc-finder/blob/609ff1a9bfd6f391d638d7f2b711983ae1de65f7/ioc_finder/ioc_grammars.py#L127) finds either URLs w/ a scheme or URLs w/o one. Update this grammar to only find URLs w/o a scheme.
Created from #196. We may also want to consider not parsing complete email addresses unless specifically asked to.
When given text, consider removing any word that will not contain an ioc. For example, can we safely remove all words that are only letters and shorter than 32 characters...
Read through the docs to identify areas for improvement/updating.
Fixes #237 . This change provides a speed-up of ≈10%
With the addition of the `included_ioc_types` arg, there are now imcompatible arguments (e.g. if `domains` not in `included_ioc_types`, but `parse_domain_from_url` is True). Improve the handling of these cases and/or documentation...