crossfeed icon indicating copy to clipboard operation
crossfeed copied to clipboard

Update robots.txt

Open cqueern opened this issue 2 years ago โ€ข 2 comments

Updated robots.txt so well-behaved search engine crawlers will not index Crossfeed instances, as I assume they're not meant to appear in search engine results due to the potentially sensitive information they collect.

๐Ÿ—ฃ Description

๐Ÿ’ญ Motivation and context

๐Ÿงช Testing

โœ… Pre-approval checklist

  • [x] This PR has an informative and human-readable title.
  • [x] Changes are limited to a single goal - eschew scope creep!
  • [ ] All future TODOs are captured in issues, which are referenced in code comments.
  • [ ] All relevant type-of-change labels have been added.
  • [ ] I have read the CONTRIBUTING document.
  • [x] These code changes follow cisagov code standards.
  • [x] All relevant repo and/or project documentation has been updated to reflect the changes in this PR.
  • [x] Tests have been added and/or modified to cover the changes in this PR.
  • [ ] All new and existing tests pass.

โœ… Pre-merge checklist

  • [ ] Revert dependencies to default branches.
  • [ ] Finalize version.

โœ… Post-merge checklist

  • [ ] Create a release.

cqueern avatar Dec 21 '23 19:12 cqueern

@cqueern Thanks for taking an interest in Crossfeed! You bring up an interesting point about whether or not we want to be indexed. I believe that allowing indexing was a deliberate decision, but I will look into it further.

Matthew-Grayson avatar Dec 21 '23 20:12 Matthew-Grayson

My pleasure! It's an important project.

Sounds good. If the decision is to exclude all robots from the entire server, suggest that line 3 in the robots.txt file remain

Disallow: /

If the decision is to allow all robots complete access, suggest that line 3 in the robots.txt file be updated to

Disallow:

cqueern avatar Dec 21 '23 20:12 cqueern