Max Ostapenko

Results 18 issues of Max Ostapenko

Progress on https://github.com/HTTPArchive/almanac.httparchive.org/issues/2891 based on the [chapter outline](https://docs.google.com/document/d/1r1TU1rH3v-686t7EaqQQ0mXg7unNKZlFYmUmbscQRtM/edit) ## How websites track (profile) you online ### Third-party tracking using [WhoTracks.me](https://github.com/ghostery/whotracks.me) - [x] number of websites that have a third-party tracker...

analysis

Applying a key sorting to parsed discovery documents before writing to cache files. Closes #411

[Analysis plan details](https://docs.google.com/document/d/1WJT9kfKHxwNl5HNAhIddefWC0vCvH3DUIxajT1gQWao/edit) ### Queries Bounce tracking: - [x] number_of_websites_with_bounce_tracking.sql CNAME - [x] most_common_cname_domains.sql IAB consent frameworks: - [x] most_common_countries_for_iab_tcf_v2.sql - [x] most_common_referrer_policy.sql - [x] most_common_strings_for_iab_usp.sql - [x] number_of_websites_with_iab.sql GPC...

analysis

- [x] metadata upload script to `wappalyzer.apps` table - [x] icons upload to `gs://wappalizer-icons/` @rviscomi @tunetheweb upon some ideas of reusing technology data further I thought we could sync the...

@rviscomi @pmeenan @tunetheweb to wrap the topic of maintenance efforts... Is it any helpful idea? **Is your feature request related to a problem? Please describe.** Currently the list of technologies...

- [x] removed GPC client-side detection, doesn't make sense in lab tests. Use `.well-known/gpc.json` instead - [x] removed DNT client-side detection, doesn't make sense in lab tests - [x] removed...

I started working on query migration guide, and found that this repo and har.fyi have a few duplicated docs. As har.fyi is a bit better looking than Github files, here...

### Search before asking - [X] I searched the [issues](https://github.com/sqlfluff/sqlfluff/issues) and found no similar issues. ### What Happened CTE closing bracket is on the same line and `layout.cte_bracket` is excluded....

bug

I was editing some wiki content recently and felt like I miss a PR review experience. It would be also great to have wiki updated in the same PR with...

project management

During the crawl no technology detected for 2% out of top 10000 pages. Running detection on my laptop is successful. ```sql SELECT page, rank FROM `httparchive.all.pages` WHERE date = "2024-09-01"...

bug