llama_index
                                
                                
                                
                                    llama_index copied to clipboard
                            
                            
                            
                        Skip when trafilatura extraction failed
Sometimes, item in the urls will be fail to fetch or parse. Especially when we use some bulk url generator like when we use serpapi. Instead failing trafilatura reader, we should skip this item and not add the result into document.
thanks! could we add a param
error_on_missingin the__init__that defaults to False? that way if users want, we can still explicitly fail on None (raise ValueError)
Good Idea, will add that.