ai4science topic
PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
UniDL4BioPep
webserver
ChemFlow
Uncover meaningful structures of latent spaces learned by generative models with flows!
TBSI-Sunwoda-Battery-Dataset
Sunwoda Electronic Co., Ltd, and Tsinghua Berkeley Shenzhen Institute (TBSI) generate the TBSI Sunwoda Battery Dataset. We open-source this dataset to inspire more data-driven novel material verificat...
MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
llamp
A web app and Python API for multi-modal RAG framework to ground LLMs on high-fidelity materials informatics. An agentic materials scientist powered by @materialsproject, @langchain-ai, and @openai
LLM-SR
This is the official repo for the paper "LLM-SR" on Scientific Equation Discovery and Symbolic Regression with LLMs