Archon icon indicating copy to clipboard operation
Archon copied to clipboard

🐛 [Bug]: Archon does not run "Code Example" Crawler when file is uploaded (vs URL/Website).

Open anow2 opened this issue 4 months ago • 3 comments

Archon Version

latest

Bug Severity

🟢 Low - Minor inconvenience

Bug Description

Archon does not run "Code Example" Crawler when file is uploaded (vs URL/Website).

I was able to get it to run when I pointed it to a hosted version of my markdown API documentation.

Steps to Reproduce

Knowledge Base Add Knowledge Upload File Upload markdown file "xxxxxxx.md" Watch Crawler skip code extraction

Expected Behavior

Code Extraction should have been ran.

Actual Behavior

Crawler skips code extraction

Error Details (if any)

None

Affected Component

🔍 Knowledge Base / RAG

Browser & OS

Chrome

Additional Context

No response

Service Status (check all that are working)

  • [x] 🖥️ Frontend UI (http://localhost:3737)
  • [x] ⚙️ Main Server (http://localhost:8181)
  • [x] 🔗 MCP Service (localhost:8051)
  • [x] 🤖 Agents Service (http://localhost:8052)
  • [x] 💾 Supabase Database (connected)

anow2 avatar Aug 29 '25 01:08 anow2

Did you change the type of the crawl to technical documentation? If it isn't technical we skip the code extraction on purpose!

coleam00 avatar Aug 30 '25 13:08 coleam00

Did you change the type of the crawl to technical documentation? If it isn't technical we skip the code extraction on purpose!

Yes - Technical/Coding was chosen.

anow2 avatar Sep 01 '25 18:09 anow2

I can confirm this is an issue, technical type was chosen. Would be very nice to have tho.

Example scenario: We have confluence with code examples on how to connect to our internal API's. These webpages are not crawlable since they are protected by sophisticated login. I would download the page as pdf and upload it into archon. This doesn't support the code example extraction.

fonteijne avatar Sep 16 '25 12:09 fonteijne