apify-docs icon indicating copy to clipboard operation
apify-docs copied to clipboard

Improve markdown structure of docs llms.txt & Actor detail

Open jbartadev opened this issue 4 months ago • 14 comments

Epic for improving the current state of the llms.txt file (https://docs.apify.com/llms.txt) and the Actor detail markdown version (e.g., https://apify.com/apify/web-scraper.md)

jbartadev avatar Aug 11 '25 13:08 jbartadev

Hey team, please keep me posted about the progress, I'll be happy to test the new version

jancurn avatar Aug 20 '25 11:08 jancurn

One more feedback:

Let's add to the very top of https://docs.apify.com/llms.txt this text, to make this info easy to find for everyone. And add some basic text there as well, to introduce the Apify platform and the docs.

The full Markdown export of the docs is available at: https://docs.apify.com/llms-full.txt

Image

jancurn avatar Aug 20 '25 22:08 jancurn

Guys, what is the status for this project? Seems there is no progress over the past 2 weeks

jancurn avatar Sep 02 '25 13:09 jancurn

@jancurn We've merged continuation few hours back: https://github.com/apify/apify-docs/pull/1796, also created ticket for fixing description as it doesn't seem llms issue, but rather general Docusaurus one: https://github.com/apify/apify-docs/issues/1870

HonzaTuron avatar Sep 02 '25 13:09 HonzaTuron

Hey guys, please what is the status of this?

jancurn avatar Sep 17 '25 21:09 jancurn

Hey guys, please what is the status of this?

Hey @jancurn, we've prioritized and fixed the main issues:

We're planning to work on the other details in the upcoming sprints

jbartadev avatar Sep 24 '25 09:09 jbartadev

@jbartadev thanks for the info and for the progress. There are still low-hanging fixes that I believe we could do:

See https://github.com/jancurn/test-llm-txt/blob/main/README.md how the llms.txt file renders.

Please let's add to the top of llms.txt this text:

The entire content of Apify documentation is available in a single Markdown file at https://docs.apify.com/llms-full.txt

The word "Clients" leaks into the description text.

Image

Also some formatting leaks into the descriptions. I think we could just remove all Markdown formatting from this and convert it to plain text, it will be better in all these cases.

Image

https://docs.apify.com/cli.md file doesn't exist, even though https://docs.apify.com/cli does - let's be consistent.

The alternate tag must contain absolute URL.

Image

jancurn avatar Sep 25 '25 13:09 jancurn

Thing to consider:

AI Agents usually doesn't send text/html in Accept header, but text/markdown or text/plain instead. We can try to serve plain/markdown content right away there.

Image

see https://x.com/bunjavascript/status/1971934734940098971 https://x.com/NathanFlurry/status/1972427441898717607

EDIT: Related Slack conversation: https://apify.slack.com/archives/CQ96RHG2U/p1759086275925279

HonzaTuron avatar Oct 07 '25 15:10 HonzaTuron

AI Agents usually doesn't send text/html in Accept header, but text/markdown or text/plain instead. We can try to serve plain/markdown content right away there.

FYI I am already working on that part.

B4nan avatar Oct 07 '25 15:10 B4nan

FYI I am already working on that part.

Done via #1997

B4nan avatar Oct 10 '25 13:10 B4nan

Another observation - the order of items in llms.txt doesn't correspond to the order of pages in the docs menu

jancurn avatar Oct 16 '25 14:10 jancurn

Another feedback - mailto links are malformatted. E.g. see https://docs.apify.com/platform/integrations/actors/integration-ready-actors

Image Image

jancurn avatar Oct 20 '25 22:10 jancurn

@jancurn not sure what's wrong with them, subject query param is valid one in mailto: https://stackoverflow.com/a/4782084/7125721

HonzaTuron avatar Oct 21 '25 07:10 HonzaTuron

It should end up as [contact support](mailto:...) instead of just the mailto link itslef.

B4nan avatar Oct 21 '25 07:10 B4nan