ps-analysis-tool icon indicating copy to clipboard operation
ps-analysis-tool copied to clipboard

Expand Coverage Open Cookie DB

Open amedina opened this issue 1 year ago • 8 comments

PSAT uses the Open Cookie Database, which is an open source repository compiling descriptions and categories for major cookies. All cookie descriptions are saved in a downloadable CSV file. All contributions to the CSV file are welcomed.

Description: The purpose of this Feature/Task, is to systematically analyze cookie usage on websites, and determine the use and categorization for cookies which are not currently part of the cookie DB.

Motivation: Increase the classification accuracy of the open cookie DB, and therefore enhance the capabilities of PSAT for providing insight nd guidance to user.

Acceptance Criteria:

  • [ ] The Open Cookie DB is expanded by 20-30% coverage and accuracy
  • [ ] Iterate; this task will span multiple releases

amedina avatar Dec 01 '23 21:12 amedina

31 dec 2023 1308 cookies

26 jan 204 1395 cookies (6.65% increase)

Thank you @milindmore22, @fellyph and @mohdsayed for the additions. Hopefully we can hit the 20% / 30% increase in the near future. I will try to add more cookies in the upcoming weeks.

jkwakman avatar Jan 26 '24 17:01 jkwakman

22 march 2024 1448 cookies (10.7% increase compared to 31 dec 2023)

jkwakman avatar Mar 22 '24 12:03 jkwakman

@gagan0123 @fellyph @milindmore22 Nice to meet you in person!

I have added a Github action for validating the following criteria:

  • CSV structure check
  • Unique UUID check
  • Cookie category check
  • Logging if cookie names are not unique. This is a warning instead of an error because duplicate cookie names do exist.

This should make contributions and maintenance easier.

Kind regards,

Jack Kwakman

jkwakman avatar Jul 06 '24 09:07 jkwakman

6 July 2024 2080 cookies (59% increase compared to 31 dec 2023)

jkwakman avatar Jul 06 '24 09:07 jkwakman

I have added @fellyph as a collaborator to the Open Cookie Database repo. This will help with the database's ongoing maintenance and improvement and long-term health. More collaborators can be added when needed.

jkwakman avatar Jul 08 '24 08:07 jkwakman

@jkwakman I will create a PR now for the Open cookie DB

fellyph avatar Jul 08 '24 08:07 fellyph

@fellyph @milindmore22 @gagan0123

Since the maximum size of a CSV for Github has been reached, I have created a separate HTML file so that you can easily continue searching, see:

https://jkwakman.github.io/Open-Cookie-Database/open-cookie-database.html

jkwakman avatar Jul 14 '24 13:07 jkwakman

@jkwakman This is wonderful 👍🏼 @fellyph can we please include the search application link in the contribution guide?

milindmore22 avatar Jul 15 '24 07:07 milindmore22