dedoc
dedoc copied to clipboard
pdf broken encoding reader
Reader to extract code from PDF with complex background using information from PDF
added my code from project needed to extract text:
- reader
- config
- h5 models
- text post processing
- fontforge wrapper
- example pdf
added script to scripts dir to extract text using reader
edited index.html, api_args, manager_config, gitignore and requirements
Please look to the logs of test pipelines - they all should pass before merge
problems with imports (numpy) during running tests installs numpy 2.0.2 but required numpy<2.0. Runned even on dedoc without my commits on devolopment branch, but problem remains. Please check