alexshmmy

Results 9 comments of alexshmmy

`llama2` models are fully supported, I just runed the model `llama-2-13b-chat.ggmlv3.q8_0.bin` with the following code in my MAC M1, 64GB run and works great: ```bash from ctransformers import AutoModelForCausalLM llm...

Hi, a similar issue arises when .docx document contain words in automatic numbering format. Docling-SmoDocling fails to convert these words. For example in the attached file: https://www.3gpp.org/ftp/tsg_ran/WG1_RL1/TSGR1_120b/Docs//R1-2501739.zip on page 3,...

@ShiroYasha18, I tried with SmolDocling, the results is same. I do not use any ocrs, just default conversion `docling input.docx` (neither `docling --no-ocr input.docx` worked) Those keywords are lost during...

Thank you @ShiroYasha18 ! It would be great if we can address this, it is million standardisation documents with such automatic numbering issue. @maxmnemonic @PeterStaar-IBM Would it be possible to...

Hi, any news on it? This is very important feature and still unresolved!

@ShiroYasha18 Do you know if this approach can be extended so text in macros that appears in .docx files can be also converted to .md with docling? This will be...

@ShiroYasha18 Thank you for response. I can give an example. In 3gpp standardisation, there are documents with text in macros. For example this one: https://www.3gpp.org/ftp/tsg_ran/WG1_RL1/TSGR1_120b/Docs//R1-2501739.zip you can see the words...

@Zilong01 Not sure if I understood your question. Tables are properly parsed and saved to .md file, irrespectively if you chose to save also the tables as .png or not....

@Zilong01 Alright! I understood. Since there is way to save the tables as an images, in the row they appear in the text, wouldn't be fairly easy with a simple...