Archon icon indicating copy to clipboard operation
Archon copied to clipboard

πŸ› [Bug]: 409 Conflict Spam When Processing a Crawl

Open phill-bramble opened this issue 4 months ago β€’ 1 comments

Archon Version

main pull as of 20/08/2025

Bug Severity

🟑 Medium - Affects functionality

Bug Description

My server log is full of 409 Conflict errors when processing crawls.

Steps to Reproduce

Crawl any site

Expected Behavior

Successful posts to Supabase

Actual Behavior

2025-08-20 15:48:51 | httpx | INFO | HTTP Request: POST http://host.docker.internal:8000/rest/v1/archon_crawled_pages⁠ "HTTP/1.1 409 Conflict"

2025-08-20 15:48:51 | search | ERROR | Failed individual insert for https://nextjs.org/docs/app/api-reference/functions/unstable_noStore:⁠ {'message': 'duplicate key value violates unique constraint "archon_crawled_pages_url_chunk_number_key"', 'code': '23505', 'hint': None, 'details': 'Key (url, chunk_number)=(https://nextjs.org/docs/app/api-reference/functions/unstable_noStore, 7) already exists.'}

2025-08-20 15:48:51 | search | INFO | Individual inserts: 12/25 successful

Error Details (if any)


Affected Component

πŸ” Knowledge Base / RAG

Browser & OS

N/A

Additional Context

No response

Service Status (check all that are working)

  • [x] πŸ–₯️ Frontend UI (http://localhost:3737)
  • [x] βš™οΈ Main Server (http://localhost:8181)
  • [x] πŸ”— MCP Service (localhost:8051)
  • [x] πŸ€– Agents Service (http://localhost:8052)
  • [x] πŸ’Ύ Supabase Database (connected)

phill-bramble avatar Aug 20 '25 15:08 phill-bramble

We still have this issue by design, we should catch duplicates earlier in the crawling process. keeping in backlog for now as it does not break any critical flows

thank you for reporting this @phill-bramble

Wirasm avatar Sep 04 '25 13:09 Wirasm