Plans for fp8 tuning going forward? Eg Deepseek v3

Open RonanKMcGovern opened this issue 1 year ago • 2 comments

As foundation models move towards being trained in eight bits, is there a plan in the roadmap to begin to support this type of approach?

Related to deepseek v3, are there plans to support mixture of expert architectures? I could fully understand if this is too far away from a coherent roadmap.

Dec 30 '24 23:12 RonanKMcGovern

Docs are broken? Almost all buttons in the "basic" menu lead to 404 errors.

And the return home button on 404 pages, returns to "docs.crawl4ai.com" which cannot be found.

Jan 09 '25 10:01 AADaoud

@AADaoud Try again "docs.crawl4ai.com", make sure clear your cache

Jan 10 '25 12:01 unclecode