intelmq ENH: Caching, advanced mapping and separating events for MISP Feed output bot

MISP Feed output bot got new features:

caching incoming messages to not re-generate on every run;
an ability to select which fields should be mapped and set additional parameters for attributes;
an ability to group messages in different MISP events, based on a field.

The bot is fully backward-compatible. By default, the previous behaviour is kept.

In addition, code related to Python 3.6 was removed and the message library was fixed not to modify the original dict instance.

This PR replaces on PR #2505.

Jul 04 '24 14:07 kamil-certat

Yeah, finally green :tada:

Jul 08 '24 11:07 kamil-certat

Are you using it in production?

Jul 08 '24 11:07 sebix

We started using it on staging, found some pain points, and now I'll test it & hopefully promote to prod in a few days - so, not yet, but soon ;)

Jul 08 '24 12:07 kamil-certat

There's not much I can check here without reading the MISP code and docs and setting up an instance.

Maybe @Rafiot, can you have a glimpse?

Jul 09 '24 07:07 sebix

My main request is:

give the possibility (or document it) how to add default tags to events (such as "FEED:Source=intelmq") to any event which gets generated via this feed. Adding a default tag helps in sorting out automatically added events and attributes in case something goes wrong.
make sure the users understand in the documentation how feeds work. Linking to the MISP documentation and what each parameter when including the MISP feed actually means. For example, did you know that if you set "caching:enabled" in a MISP feed, then that means the feed events and attributes don't get added to the events (and attributes) table in MISP. It's totally non-intuitive. Why am I saying this? I had to figure it out the hard way myself.
basically we need a tutorial on docs.intelmq.org on how to run this properly! It's a very cool feature and addition but you can quickly shoot yourself in the foot if you don't know how it works and what every parameter in MISP means.

Jul 09 '24 08:07 aaronkaplan

Thanks for the review!

re 1: It's a good idea, I'll look into it. I was concentrated on what we actually need, and I do think there are more things we can improve in the bot. re 2: It's a good point, I will point to the MISP documentation. I also had to explore what we really can do, the PyMISP documentation wasn't really helpful :D re 3: yeah, I agree it needs a tutorial (and I've planned it), but it's a separated thing, I won't include it here. My plan is to write a blog post once we establish the final integration internally, and then we could include it in the documentation as well. I expect to publish it around September, hopefully. There is a queue of what I'd like to write about, and a limited time ;)

Jul 09 '24 09:07 kamil-certat

@aaronkaplan

Could you have a look again? I have implemented tagging as well as rewritten the documentation, added config examples, implemented validation in the check method and refactored code, including renaming methods to be more clear what they do or adding doc strings where I found it useful. I have also added descriptions to complex test cases explaining what should happen.

I do admit that the configuration of the bot is complex. I did my best to marriage flexibility and readability, but I think in the future we may eventually redesign this configuration, based on feedback. I'm also open for any suggestions.

Jul 16 '24 12:07 kamil-certat

@aaronkaplan you requested some changes to this PR and @kamil-certat implemented various changes based on your feedback. Can you please re-check?

Aug 27 '24 09:08 sebix

@aaronkaplan this PR is waiting for your approval

Aug 29 '25 11:08 sebix

Still need to test against a misp server

Oct 23 '25 21:10 aaronkaplan

rebased on develop. there were quite some conflicts to fix.

Oct 24 '25 09:10 sebix

now the JSON parser test fails, the output differs to the expected one.

The PR contains changes in intelmq/lib/message.py. I don't know yet if this is a rebase error or an error in the code.

Oct 24 '25 09:10 sebix

Still need to test against a misp server

As discussed yesterday, I'll take this over

Oct 24 '25 09:10 sebix

I'm working on a few improvements / additional requirements right now, a new code should come today or tomorrow. I'll then address comments

Oct 27 '25 09:10 kamil-certat

Documentation for the new parameter flat_events is missing

Oct 27 '25 15:10 sebix

I'm aware of that, I'll adjust documentation tomorrow

Oct 27 '25 15:10 kamil-certat

I also have to add matching unit tests ;)

Oct 27 '25 15:10 kamil-certat

BTW, all methods were documented in bot-development :thinking:

Oct 28 '25 09:10 kamil-certat

BTW, all methods were documented in bot-development 🤔

Oh, sorry. I don't know where I was looking.

Oct 28 '25 09:10 sebix