Abhishek
Abhishek
We can close the PR #38709 if we decide to strip this feature completely.
cc @dstandish
I think we can remove it cause anyway it is unreachable code. So we can be sure no one is using it.
> It'd be possible for a user to swap in TaskHandlerWithCustomFormatter in `airflow_local_settings`, right? Nah, it's inheriting StreamHandler which is not at all used for Task logs. Task logs are...
I think this error exists with `UnstructuredHTMLLoader` and `UnstructuredMarkdownLoader` both.
https://github.com/hwchase17/langchain/issues/5264
I created a markdownloader copying TextLoader. It uses marko for converting into html and then BeautifulSoup to extract text. Seems to be working for me. https://github.com/abhishekbhakat/privateGPT/tree/main
I changed the `file_data.read_text()` to: ```python return string_reader.load_data([file_data.read_text("utf-8", errors="replace").replace("\x00", "\uFFFD")]) ``` File "/Users/sk/privateGPT/private_gpt/components/ingest/ingest_helper.py", in _load_file_to_documents
Yeah sure. Renaming now. Will PR this later this week.