papercast
                                
                                 papercast copied to clipboard
                                
                                    papercast copied to clipboard
                            
                            
                            
                        A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Papercast

An extensible pipeline tool and plugin ecosystem for processing technical documents. Written in Python.
Features
| Feature | Examples | 
|---|---|
| Add documents in multiple formats, from popular sources: | PDF LaTeX ArXiv SemanticScholar | 
| Flexible Text Extraction | GROBID More coming soon! Write your own! | 
| Flexible Text Narration | OSX saycommandMore coming soon! Write your own! | 
| Publish to multiple endpoints: | Self-hosted RSS podcast using GitHub Pages Any other endpoint you can think of | 
| Run anywhere: | Local machine Remote server Cloud (AWS, GCP, Azure, etc.) |