olmocr icon indicating copy to clipboard operation
olmocr copied to clipboard

Toolkit for linearizing PDFs for LLM datasets/training

Results 61 olmocr issues
Sort by recently updated
recently updated
newest added

### 🚀 The feature, motivation and pitch here is how to launch a api server using sglang for it quick start page Launch A Server from sglang.test.test_utils import is_in_ci from...

### 🚀 The feature, motivation and pitch Currently Nvidia HW is required - support for macOS will be really useful. ### Alternatives _No response_ ### Additional context _No response_

Dear allenai Team, Congrats on this beautiful tool! I wanted to express my admiration for the incredible work you've done here. I’m grateful for all that you’ve accomplished so far....

### 🐛 Describe the bug Component: Pipeline (pipeline.py) Version: Latest as of February 27, 2025 (assumed from git clone) Environment: OS: Ubuntu Python: 3.12 GPU: NVIDIA RTX 3090 (24 GB...

bug

### 🐛 Describe the bug python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/horribleocr.pdf INFO:olmocr.check:pdftoppm is installed and working. 2025-02-27 15:16:34,235 - __main__ - INFO - Got --pdfs argument, going to add to...

bug

### 🐛 Describe the bug ```bash (olmocr) ubuntu@xxx-xxx-xxx-xxx:~$ python -m olmocr.pipeline ./localworkspace --pdfs paper.pdf INFO:olmocr.check:pdftoppm is installed and working. 2025-02-27 10:17:40,491 - __main__ - INFO - Got --pdfs argument, going...

bug

公式无法正确识别,返回的结果是错误的 这个公式识别结果为 $$\begin{array}{l} u_1(x, y, z) \ u_2(x, y, z) \ u_3(x, y, z) \end{array}] [= \sum_{k_1, k_2, k_3} e^{i\frac{xk_1 + yk_2 + zk_3}{l}} \begin{bmatrix} C(k_1, k_2, k_3) \ n_1(k_1,...

### 🐛 Describe the bug When I do ``` python -m olmocr.pipeline s3://my-bucket/workspace --pdfs s3://my-bucket/inputs/*.pdf ``` any PDF files containing a comma in their name won't get processed. Say for...

bug

请问是否支持docker 部署 和 本地详细的部署文档。谢谢!

### 🚀 The feature, motivation and pitch Thank you for releasing an amazing work. I think in traditional OCR tools, metadata of text like coordinate, font, size, color, style, etc....