rag-gpt
rag-gpt copied to clipboard
Optimize the strategy for extracting text from HTML webpages by removing unnecessary and distracting information
Remove all the tags of ['nav', 'footer', 'aside', 'script', 'style'] that are not meaningful for the extraction #61