crawl4ai
crawl4ai copied to clipboard
Documentation commands fail with 404 error (missing llm.txt)
Issue Description
When using the crawl4ai command-line tool, I encountered the following issues with the documentation commands: 1. Running crawl docs update or crawl docs index results in a 404 error:
[ERROR]... × Failed to fetch docs: 404 Client Error: Not Found for url: https://api.github.com/repos/unclecode/crawl4ai/contents/docs/llm.txt Error: 404 Client Error: Not Found for url: https://api.github.com/repos/unclecode/crawl4ai/contents/docs/llm.txt
2. Running crawl docs list displays an empty table for document sections:
+------------+ | Sections | +------------+ +------------+
Steps to Reproduce 1. Install Crawl4AI (version 0.4.247) via pip:
pip install crawl4ai
2. Activate the Conda environment:
conda activate crawl4ai_env
3. Run the following commands:
• crawl docs update
• crawl docs index
• crawl docs list
4. Observe the error messages and results.
Expected Behavior • crawl docs update and crawl docs index should run successfully and load the documentation content. • crawl docs list should display the available document sections.
Actual Output 1. Error Logs from crawl docs update and crawl docs index:
[ERROR]... × Failed to fetch docs: 404 Client Error: Not Found for url: https://api.github.com/repos/unclecode/crawl4ai/contents/docs/llm.txt Error: 404 Client Error: Not Found for url: https://api.github.com/repos/unclecode/crawl4ai/contents/docs/llm.txt
2. Output from crawl docs list:
+------------+ | Sections | +------------+ +------------+
Environment Information • OS: macOS • Python Version: 3.10 • Crawl4AI Version: 0.4.247 • Installation Method: pip install crawl4ai
Additional Information 1. Repository Content: • The docs directory in the GitHub repository does not contain the llm.txt file, which the tool attempts to access:
https://api.github.com/repos/unclecode/crawl4ai/contents/docs/llm.txt
2. Potential Impact:
• This issue appears to affect only the documentation commands. Other features (e.g., crawling webpages) seem unaffected, but I would appreciate confirmation.
3. Troubleshooting Attempts:
• Checked the docs directory on the GitHub repository to verify the presence of the llm.txt file.
• Verified the tool’s version and environment setup.
Suggested Fix or Guidance
Could you provide clarification on: 1. Whether the documentation feature is fully implemented or under development. 2. Any additional steps required to enable this functionality. 3. If llm.txt is not intended to be part of the repository, should this feature be disabled or modified?
您可以复制此模板并提交到 Crawl4AI GitHub Issues 页面。如需进一步调整,请告诉我开发者的具体反馈要求!