ccextractor icon indicating copy to clipboard operation
ccextractor copied to clipboard

[QUESTION]How to get Teletext pages with possible subtitles without actual extraction

Open workflowsguy opened this issue 5 years ago • 11 comments

CCExtractor version (using the --version parameter preferably) : 0.87

In raising this issue, I confirm the following:

  • [X] I have read and understood the contributors guide.
  • [X] I have checked that the bug-fix I am reporting can be replicated, or that the feature I am suggesting isn't already present.
  • [X] I have checked that the issue I'm posting isn't already reported.
  • [X] I have checked that the issue I'm porting isn't already solved and no duplicates exist in closed issues and in opened issues
  • [X] I have checked the pull requests tab for existing solutions/implementations to my issue/suggestion.
  • [X] I have used the latest available version of CCExtractor to verify this issue exists.

My familiarity with the project is as follows:

  • [X] I am an active contributor to CCExtractor.

Necessary information

  • Is this a regression (did it work before)? [X] NO
  • What platform did you use? [ ] Windows - [ ] Linux - [X] Mac
  • What were the used arguments? -out=report

Additional information

When running ccextractor against a video file, I get an output e.g. like this:

Notice: Teletext page with possible subtitles detected: 152
- No teletext page specified, first received suitable page is 152, not guaranteed
Notice: Teletext page with possible subtitles detected: 888
Notice: Teletext page with possible subtitles detected: 151
Notice: Teletext page with possible subtitles detected: 150
Notice: Teletext page with possible subtitles detected: 889

I would like to get this information without ccextractor automatically extracting the subtitles because I need to specify the actual page to extract from based on service name and desired language. I thought the option -out=report would achieve this, but for the same video file I get the output

//////// Program #10302: ////////
DVB Subtitles: No
Teletext: Yes
Pages With Subtitles: 
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No

MPEG-4 Timed Text: No

Is this a bug/limitation in the report parameter or is there a different way to achieve this?

workflowsguy avatar Nov 01 '18 15:11 workflowsguy

@workflowsguy you can easily extract captions to console only with -stdout parameter. Note that ccextractor will ignore -o, -o1 and -o2 in this case. So the most simple way to use this parameter is: input_file -stdout You can read more about this option(and other ccextractor available parameters) here: https://ccextractor.org/public:general:command_line_usage

navimakarov avatar Dec 14 '18 19:12 navimakarov

@MakarovGCI2018 sorry, but I do not understand how your answer relates to my question. Using -stdout still causes ccextractor to parse the video file completely which is not what I want.

workflowsguy avatar Dec 18 '18 17:12 workflowsguy

@workflowsguy sorry for misunderstanding. So you just want to get info about teletext pages with possible subtitles without actual processing it using -out=report, right?

navimakarov avatar Dec 18 '18 20:12 navimakarov

@navimakarov, sorry for the long delay in replying. Yes, I need some way to get information if the video file contains teletext pages with possible subtitles without actually starting the extraction process.

workflowsguy avatar Feb 19 '19 09:02 workflowsguy

Hey guys, I am new to open source. I would like to work on this issue if it is still open. Please guide me how to get started and work on this issue.

neilmehta31 avatar Jan 06 '21 10:01 neilmehta31

@neilmehta31 By all means go ahead. Usually you want to start by reproducing it.

cfsmp3 avatar Jan 09 '21 18:01 cfsmp3

Hey @cfsmp3 , I got the following output while running against one of the sample recording files given on the website

Stream Mode: Transport Stream
Program Count: 1
Program Numbers: 4287 
PID: 201, Program: 4287, MPEG-2 video
PID: 202, Program: 4287, MPEG-1 audio
PID: 205, Program: 4287, DVB Subtitles
PID: 206, Program: 4287, MPEG-1 audio
PID: 250, Program: 4287, MPEG-2 private table sections
PID: 7201, Program: 4287, ISO/IEC 13818-6 type B
PID: 7219, Program: 4287, ISO/IEC 13818-6 type B
PID: 7270, Program: 4287, MPEG-2 private table sections
//////// Program #4287: ////////
DVB Subtitles: Yes
Teletext: No
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No

MPEG-4 Timed Text: No

If its the wrong file could you please provide a link to that file to reproduce the issue. Thanks

neilmehta31 avatar Jan 10 '21 06:01 neilmehta31

When running ccextractor against a video file, I get an output e.g. like this:

Notice: Teletext page with possible subtitles detected: 152
- No teletext page specified, first received suitable page is 152, not guaranteed
Notice: Teletext page with possible subtitles detected: 888
Notice: Teletext page with possible subtitles detected: 151
Notice: Teletext page with possible subtitles detected: 150
Notice: Teletext page with possible subtitles detected: 889

@workflowsguy can you provide the video sample used? I am unable to reproduce this.

I ran ccextractor path_to_file for the video file (provided in ccextractor's tv samples page) containing teletex subtitles. I got the following results.

Opening file: linux/tests/teletex_test.mpg
File seems to be a transport stream, enabling TS mode
Analyzing data in general mode
VBI/teletext stream ID 1044 (0x414) for SID 1040 (0x410)
- Programme Identification Data = FAB-TELETEXT SYSTEM 
- Universal Time Co-ordinated = Mon Jan  9 18:27:47 2017
100%  |  24:51
Teletext decoder: 37286 packets processed 

Number of NAL_type_7: 0
Number of VCL_HRD: 0
Number of NAL HRD: 0
Number of jump-in-frames: 0
Number of num_unexpected_sei_length: 0

Min PTS:				04:21:47:620
Max PTS:				04:46:39:020
Length:				 00:24:51:400
Done, processing time = 3 seconds

There is no information like Notice: Teletext page with possible subtitles detected: 152

siv2r avatar Jan 28 '21 04:01 siv2r

I would love to work on this issue, I am a beginner, can you please guide me.

85ayush avatar Feb 23 '21 11:02 85ayush

Hey I want to work on this issue @workflowsguy. please tell how to get started I am a beginner

vaishnavi192 avatar Nov 15 '23 14:11 vaishnavi192

@vaishnavi192, I am the wrong person to ask for guidance. I asked this question here 5 years ago and it has not been answered/adressed to my satisfaction since then. I have long since moved on.

workflowsguy avatar Nov 17 '23 11:11 workflowsguy