ccextractor
ccextractor copied to clipboard
[QUESTION]How to get Teletext pages with possible subtitles without actual extraction
CCExtractor version (using the --version parameter preferably) : 0.87
In raising this issue, I confirm the following:
- [X] I have read and understood the contributors guide.
- [X] I have checked that the bug-fix I am reporting can be replicated, or that the feature I am suggesting isn't already present.
- [X] I have checked that the issue I'm posting isn't already reported.
- [X] I have checked that the issue I'm porting isn't already solved and no duplicates exist in closed issues and in opened issues
- [X] I have checked the pull requests tab for existing solutions/implementations to my issue/suggestion.
- [X] I have used the latest available version of CCExtractor to verify this issue exists.
My familiarity with the project is as follows:
- [X] I am an active contributor to CCExtractor.
Necessary information
- Is this a regression (did it work before)? [X] NO
- What platform did you use? [ ] Windows - [ ] Linux - [X] Mac
- What were the used arguments?
-out=report
Additional information
When running ccextractor
against a video file, I get an output e.g. like this:
Notice: Teletext page with possible subtitles detected: 152
- No teletext page specified, first received suitable page is 152, not guaranteed
Notice: Teletext page with possible subtitles detected: 888
Notice: Teletext page with possible subtitles detected: 151
Notice: Teletext page with possible subtitles detected: 150
Notice: Teletext page with possible subtitles detected: 889
I would like to get this information without ccextractor
automatically extracting the subtitles because I need to specify the actual page to extract from based on service name and desired language.
I thought the option -out=report
would achieve this, but for the same video file I get the output
//////// Program #10302: ////////
DVB Subtitles: No
Teletext: Yes
Pages With Subtitles:
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No
MPEG-4 Timed Text: No
Is this a bug/limitation in the report
parameter or is there a different way to achieve this?
@workflowsguy you can easily extract captions to console only with -stdout parameter.
Note that ccextractor will ignore -o, -o1 and -o2 in this case.
So the most simple way to use this parameter is:
input_file -stdout
You can read more about this option(and other ccextractor available parameters) here:
https://ccextractor.org/public:general:command_line_usage
@MakarovGCI2018 sorry, but I do not understand how your answer relates to my question.
Using -stdout
still causes ccextractor
to parse the video file completely which is not what I want.
@workflowsguy sorry for misunderstanding. So you just want to get info about teletext pages with possible subtitles without actual processing it using -out=report, right?
@navimakarov, sorry for the long delay in replying. Yes, I need some way to get information if the video file contains teletext pages with possible subtitles without actually starting the extraction process.
Hey guys, I am new to open source. I would like to work on this issue if it is still open. Please guide me how to get started and work on this issue.
@neilmehta31 By all means go ahead. Usually you want to start by reproducing it.
Hey @cfsmp3 , I got the following output while running against one of the sample recording files given on the website
Stream Mode: Transport Stream
Program Count: 1
Program Numbers: 4287
PID: 201, Program: 4287, MPEG-2 video
PID: 202, Program: 4287, MPEG-1 audio
PID: 205, Program: 4287, DVB Subtitles
PID: 206, Program: 4287, MPEG-1 audio
PID: 250, Program: 4287, MPEG-2 private table sections
PID: 7201, Program: 4287, ISO/IEC 13818-6 type B
PID: 7219, Program: 4287, ISO/IEC 13818-6 type B
PID: 7270, Program: 4287, MPEG-2 private table sections
//////// Program #4287: ////////
DVB Subtitles: Yes
Teletext: No
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No
MPEG-4 Timed Text: No
If its the wrong file could you please provide a link to that file to reproduce the issue. Thanks
When running
ccextractor
against a video file, I get an output e.g. like this:Notice: Teletext page with possible subtitles detected: 152 - No teletext page specified, first received suitable page is 152, not guaranteed Notice: Teletext page with possible subtitles detected: 888 Notice: Teletext page with possible subtitles detected: 151 Notice: Teletext page with possible subtitles detected: 150 Notice: Teletext page with possible subtitles detected: 889
@workflowsguy can you provide the video sample used? I am unable to reproduce this.
I ran ccextractor path_to_file
for the video file (provided in ccextractor's tv samples page) containing teletex subtitles. I got the following results.
Opening file: linux/tests/teletex_test.mpg
File seems to be a transport stream, enabling TS mode
Analyzing data in general mode
VBI/teletext stream ID 1044 (0x414) for SID 1040 (0x410)
- Programme Identification Data = FAB-TELETEXT SYSTEM
- Universal Time Co-ordinated = Mon Jan 9 18:27:47 2017
100% | 24:51
Teletext decoder: 37286 packets processed
Number of NAL_type_7: 0
Number of VCL_HRD: 0
Number of NAL HRD: 0
Number of jump-in-frames: 0
Number of num_unexpected_sei_length: 0
Min PTS: 04:21:47:620
Max PTS: 04:46:39:020
Length: 00:24:51:400
Done, processing time = 3 seconds
There is no information like Notice: Teletext page with possible subtitles detected: 152
I would love to work on this issue, I am a beginner, can you please guide me.
Hey I want to work on this issue @workflowsguy. please tell how to get started I am a beginner
@vaishnavi192, I am the wrong person to ask for guidance. I asked this question here 5 years ago and it has not been answered/adressed to my satisfaction since then. I have long since moved on.