langchain icon indicating copy to clipboard operation
langchain copied to clipboard

Add get_text_separator parameter to BSHTMLLoader

Open maver1ck opened this issue 2 years ago • 1 comments

By default get_text doesn't separate content of different HTML tag. Adding option for specifying separator helps with document splitting.

maver1ck avatar Apr 25 '23 20:04 maver1ck

@hwchase17 please approve one more time, I fixed linting issues and added test

maver1ck avatar Apr 26 '23 07:04 maver1ck