alexshmmy
alexshmmy
`llama2` models are fully supported, I just runed the model `llama-2-13b-chat.ggmlv3.q8_0.bin` with the following code in my MAC M1, 64GB run and works great: ```bash from ctransformers import AutoModelForCausalLM llm...
Hi, a similar issue arises when .docx document contain words in automatic numbering format. Docling-SmoDocling fails to convert these words. For example in the attached file: https://www.3gpp.org/ftp/tsg_ran/WG1_RL1/TSGR1_120b/Docs//R1-2501739.zip on page 3,...
@ShiroYasha18, I tried with SmolDocling, the results is same. I do not use any ocrs, just default conversion `docling input.docx` (neither `docling --no-ocr input.docx` worked) Those keywords are lost during...
Thank you @ShiroYasha18 ! It would be great if we can address this, it is million standardisation documents with such automatic numbering issue. @maxmnemonic @PeterStaar-IBM Would it be possible to...
Hi, any news on it? This is very important feature and still unresolved!
@ShiroYasha18 Do you know if this approach can be extended so text in macros that appears in .docx files can be also converted to .md with docling? This will be...
@ShiroYasha18 Thank you for response. I can give an example. In 3gpp standardisation, there are documents with text in macros. For example this one: https://www.3gpp.org/ftp/tsg_ran/WG1_RL1/TSGR1_120b/Docs//R1-2501739.zip you can see the words...
@Zilong01 Not sure if I understood your question. Tables are properly parsed and saved to .md file, irrespectively if you chose to save also the tables as .png or not....
@Zilong01 Alright! I understood. Since there is way to save the tables as an images, in the row they appear in the text, wouldn't be fairly easy with a simple...