wordpress-activitypub icon indicating copy to clipboard operation
wordpress-activitypub copied to clipboard

Question: How does this plugin interact with moderation and trust & safety on the fediverse?

Open ThisIsMissEm opened this issue 2 years ago • 12 comments

Hi! I've had a look through the changelog & through open issues, and can't see anything about this topic.

Will this plugin implement support for handling Flag activities, additionally, will it support (at a bare minimum) a means through which to specify instances to not federate with? Or a way to block certain actors from interacting with the blog via the plugin?

Would be happy to provide advice either here or via the IFTAS matrix server.

ThisIsMissEm avatar Sep 13 '23 08:09 ThisIsMissEm

Hey @ThisIsMissEm, thanks for reaching out and offering your help!

WordPress has some basic functionality to help block messages based on certain criteria.

When a comment contains any of these words in its content, author name, URL, email, IP address, or browser’s user agent string, it will be put in the Trash. One word or IP address per line. It will match inside words, so “press” will match “WordPress”.

  • This will block every message from users that match one of the defined block words/tags/domains/eMails/WebFinger-IDs.
  • Besides of that, every incoming post will be handled by WordPress' spam filter/system and held for moderation.
  • And there are possibilities to manage/delet Followers.

This is not yet perfect and will not block the following process completely, so I would love to discuss further possibilities.

Thanks again!

pfefferle avatar Sep 13 '23 08:09 pfefferle

@ThisIsMissEm in addition to what Matthias has outlined, the plugin supports Authorized fetch.

It isn't yet documented but to enable it, you would set the ACTIVITYPUB_AUTHORIZED_FETCH const to true: define( 'ACTIVITYPUB_AUTHORIZED_FETCH', true )

mediaformat avatar Sep 13 '23 16:09 mediaformat

@mediaformat Right, but Authorized Fetch only works if you've a list of domains/instances from which you don't wish to approve fetches from.

@pfefferle so, is the extent of the moderation thinking currently just "we'll let wordpress handle comment moderation"? Because you might have problems with that due to a group known as "the worst of the worst" (essentially malicious actors), some tactics seen used have been hammering an instance with CSAM or other horribly graphic imagery, which means a person reviewing the comment moderation queue would still have to see those.

I just want to make sure that there's a defense in depth approach here, and that we can shield your users from the most harmful and vile instances.

ThisIsMissEm avatar Sep 13 '23 20:09 ThisIsMissEm

There are some WordPress mechanics that help atm. (WordPress strips all images from comments automatically for example) but I totally agree that there have to be some more possibilities! I am looking forward to discuss these options with you on the IFTAS server.

pfefferle avatar Sep 14 '23 05:09 pfefferle

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Jan 13 '24 01:01 github-actions[bot]

Just bumping this thread, offer of help & discussion is still valid

ThisIsMissEm avatar Jan 13 '24 02:01 ThisIsMissEm

Because you might have problems with that due to a group known as "the worst of the worst" (essentially malicious actors), some tactics seen used have been hammering an instance with CSAM or other horribly graphic imagery, which means a person reviewing the comment moderation queue would still have to see those.

I believe WordPress' "Disallowed Comment Keys" setting mentioned by @pfefferle above should help with that. Since the posts from flagged malicious actors are sent to the Trash directly, moderators would not see the comments in the moderation queue; they'd bypass that queue altogether.

jeherve avatar Apr 16 '24 07:04 jeherve

Since the posts from flagged malicious actors are sent to the Trash directly, moderators would not see the comments in the moderation queue; they'd bypass that queue altogether.

Ideally there should be a way to configure it so the malicious actors wouldn't be able to follow the WordPress blog at all to begin with though, right? The same as if you'd entirely defederated from them and/or had authorised fetch turned on with an explicit allow-list.

VirtualWolf avatar Jun 09 '24 11:06 VirtualWolf