olmocr issues

sglang or vllm api interface

1

### 🚀 The feature, motivation and pitch here is how to launch a api server using sglang for it quick start page Launch A Server from sglang.test.test_utils import is_in_ci from...

devops724

macOS support

16

### 🚀 The feature, motivation and pitch Currently Nvidia HW is required - support for macOS will be really useful. ### Alternatives _No response_ ### Additional context _No response_

amitkot

Request for Apple Silicon Support.

Dear allenai Team, Congrats on this beautiful tool! I wanted to express my admiration for the incredible work you've done here. I’m grateful for all that you’ve accomplished so far....

LeaveNhA

CUDA-ooM with large PDFs

5

### 🐛 Describe the bug Component: Pipeline (pipeline.py) Version: Latest as of February 27, 2025 (assumed from git clone) Environment: OS: Ubuntu Python: 3.12 GPU: NVIDIA RTX 3090 (24 GB...

infinitytrans

bug

### 🐛 Describe the bug python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/horribleocr.pdf INFO:olmocr.check:pdftoppm is installed and working. 2025-02-27 15:16:34,235 - __main__ - INFO - Got --pdfs argument, going to add to...

limuan123123

bug

SGlang does not meet expectations.(sqlite3.OperationalError: no such column: "size" - should this be a string literal in single-quotes?) A10

4

### 🐛 Describe the bug ```bash (olmocr) ubuntu@xxx-xxx-xxx-xxx:~$ python -m olmocr.pipeline ./localworkspace --pdfs paper.pdf INFO:olmocr.check:pdftoppm is installed and working. 2025-02-27 10:17:40,491 - __main__ - INFO - Got --pdfs argument, going...

Mashiro2000

bug

公式识别错误

公式无法正确识别，返回的结果是错误的这个公式识别结果为 $$\begin{array}{l} u_1(x, y, z) \ u_2(x, y, z) \ u_3(x, y, z) \end{array}] [= \sum_{k_1, k_2, k_3} e^{i\frac{xk_1 + yk_2 + zk_3}{l}} \begin{bmatrix} C(k_1, k_2, k_3) \ n_1(k_1,...

gaoming1227

Fails to process S3 files with a comma in the name

### 🐛 Describe the bug When I do ``` python -m olmocr.pipeline s3://my-bucket/workspace --pdfs s3://my-bucket/inputs/*.pdf ``` any PDF files containing a comma in their name won't get processed. Say for...

Dennitz

bug

请问是否支持docker 部署和本地详细的部署文档。谢谢！

4

请问是否支持docker 部署和本地详细的部署文档。谢谢！

limuan123123

Text metadata

3

### 🚀 The feature, motivation and pitch Thank you for releasing an amazing work. I think in traditional OCR tools, metadata of text like coordinate, font, size, color, style, etc....

joey234

olmocr
olmocr copied to clipboard

Metadata

sglang or vllm api interface

macOS support

Request for Apple Silicon Support.

CUDA-ooM with large PDFs

启动单个pdf报 502的错误。

SGlang does not meet expectations.(sqlite3.OperationalError: no such column: "size" - should this be a string literal in single-quotes?) A10

公式识别错误

Fails to process S3 files with a comma in the name

请问是否支持docker 部署和本地详细的部署文档。谢谢！

Text metadata

← Metadata

Owner

Metadata

olmocr olmocr copied to clipboard

Metadata

← Metadata

Owner

Metadata

olmocr
olmocr copied to clipboard