[FEAT] Capturing the caption of the figure or tables

Open Maikiii44 opened this issue 2 months ago • 2 comments

💡 Cap

It would be helpful for the model to identify images in a document or scientific paper and determine the corresponding caption for each. The output could be in the JSON.

Nov 04 '25 08:11 Maikiii44

💡 Cap

It would be helpful for the model to identify images in a document or scientific paper and determine the corresponding caption for each. The output could be in the JSON.

Are you talking about this ? https://github.com/datalab-to/marker/issues/838

Nov 04 '25 08:11 kipavy

please can you provide code with llm

Nov 08 '25 12:11 ankit8347