Duplicate programme entries
Describe the issue For some programmes there seems to be 2 entries. I've noticed this on channels including BBC ONE HD and 4Music - have not looked much further yet.
An example is that the following entry is repeated
Sometimes the entry and its duplicate are consecutive in the output. But I've also seen a block of duplicates following their originals. Ie. Prog1, Prog2, Prog3, Prog4, Prog1, Prog2, Prog3, Prog4
Expected behavior Single entry for each programme
Screenshots N/A.
Device information Windows 10 PC; Chrome browser, saving to a file for examination with Notepad
Additional context Search for instances of "All The Hits, All Night" in the attached file (saved directly from Chrome, then edited down to make the file a sensible size). epg.txt
Oh, apologies, the XML I pasted into the issue description has been rendered so you can't see the detail (ie. same start and stop dates/times for the two entries). Best to look in the attachment as per the Additional context.
Cheers, Dan
Looks like the duplicates are last in one day's data and also first in the next day's - since there's no checking for duplicates it just yeets them into the programme list.
Have a look at the data for BBCOneLondonHD from the last programme on the 27th ("Greta") through the next several hours in the attached (different) extract. All seems fine for the following 20 or so entries up to start=20240428155000, but then it goes back and repeats everything from the last programme on the 27th, so that whole block of 20 or so entries is repeated.
Is this is a feature of the source from which you get the data?
For my use, I can deal with the dups easily enough - just have my fetch function check for a duplicate and, if so, don't include it in the output. Would be nice to have this done at source or in your code but not important.
Cheers, Dan
Is this is a feature of the source from which you get the data?
Probably not, most channels are sourced from Sky or Freeview
Would be nice to have this done at source or in your code but not important.
I'll probably get round to it at some point, though I don't give timescales since this is a hobby. PRs are welcome though 😊