markitdown
markitdown copied to clipboard
"Description" hardcoded in English when converting images to another language
This is hardcoded: and it should not:
if llm_client is not None and llm_model is not None:
md_content += (
"\n# Description:\n"
+ self._get_llm_description(
local_path,
extension,
llm_client,
llm_model,
prompt=kwargs.get("llm_prompt"),
).strip()
+ "\n"
)
If I am passing a prompt, let ME control if I want a "Description" heading or not, and in what language it should be.
Suggestion: only append it if the prompt is None.
Good point. The library was originally designed to allow LLMs to read various file formats -- and they are inherently multilingual. But, I agree this is not appropriate for other use-cases.