videocr-PaddleOCR
videocr-PaddleOCR copied to clipboard
Extract hardcoded subtitles from videos using machine learning
大佬好,运行demo中的脚本 from videocr import get_subtitles if __name__ == '__main__': # This check is mandatory for Windows. print(get_subtitles( video_path='2.mp4', lang='ch', time_start='0:00', time_end='', conf_threshold=75, sim_threshold=80, use_fullframe=False, det_model_dir=None, rec_model_dir=None, use_gpu=False, brightness_threshold=None, similar_image_threshold=100, similar_pixel_threshold=25,...
The algorithm currently merges 2 subtitles by picking the one with the highest average confidence. Ideally, all characters with high confidence should be used if there is no conflict in...
`from videocr import save_subtitles_to_file #@title OCR parameters input_file_path = "example_cropped.mp4" #@param {type:"string"} output_file_path = "example.srt" #@param {type:"string"} language_code = "ch" #@param {type:"string"} use_gpu = False #@param {type:"boolean"} start_time = "00:00"...
Unable to seek to specific frames, doesn't return an accurate frame count. issue appeared with a video that uses vp9
If so, then PaddleOCR only needs to be run on one frame per subtitle.
Hi, I've wanted to use your program with docker on Linux but I had the following problems: ### Issue 1: Following PaddleOCR environment setup instructions I used the `paddlepaddle/paddle:2.1.3-gpu-cuda10.2-cudnn7` image,...
error: OpenCV(4.6.0) /io/opencv/modules/imgproc/src/color.cpp:182: error: (-215:Assertion failed) !_src.empty() in function 'cvtColor' When I click OCR parameters, This error happens. What should I do?