database icon indicating copy to clipboard operation
database copied to clipboard

Abandoned games at the end of the month

Open tmmlaarhoven opened this issue 3 years ago • 0 comments

(migrated from https://github.com/ornicar/lila/issues/9720)

Regarding the databases on https://database.lichess.org/, most files for different months were generated long after the months were over, which meant that abandoned games had long been removed from the server.

Starting from the recent July 2021 database however, the PGN actually contains games which were cancelled/abandoned near the end of the month.

One such example game in the July 2021 database is given below, which was played/abandoned on July 30th. Note that the corresponding link https://lichess.org/zPDH02kW is now long gone, but presumably when the static export was generated the game still existed on the server.

[Event "Rated Correspondence game"]
[Site "https://lichess.org/zPDH02kW"]
[Date "2021.07.30"]
[Round "-"]
[White "Minty0209"]
[Black "Amini_alireza"]
[Result "*"]
[UTCDate "2021.07.30"]
[UTCTime "02:18:45"]
[WhiteElo "1500"]
[BlackElo "1500"]
[WhiteTitle "WFM"]
[ECO "?"]
[Opening "?"]
[TimeControl "-"]
[Termination "Abandoned"]

*

To fix the database and exclude such games (as was also done for all months prior to July 2021), maybe one could generate the static database later, when the aborted games have been removed from the server, or the export could manually filter out these aborted games.

See also the corresponding Zulip discussion.

tmmlaarhoven avatar Sep 04 '21 23:09 tmmlaarhoven