almanac.httparchive.org icon indicating copy to clipboard operation
almanac.httparchive.org copied to clipboard

HTTP 2022

Open rviscomi opened this issue 2 years ago β€’ 22 comments

HTTP 2022

HTTP illustration

If you're interested in contributing to the HTTP chapter of the 2022 Web Almanac, please reply to this issue and indicate which role or roles best fit your interest and availability: author, reviewer, analyst, and/or editor.

Content team

Lead Authors Reviewers Analysts Editors Coordinator
@paivaspol @paivaspol @tunetheweb @rmarx @LPardue @paivaspol @paulcalvano
Expand for more information about each role πŸ‘€
  • The content team lead is the chapter owner and responsible for setting the scope of the chapter and managing contributors' day-to-day progress.
  • Authors are subject matter experts and lead the content direction for each chapter. Chapters typically have one or two authors. Authors are responsible for planning the outline of the chapter, analyzing stats and trends, and writing the annual report.
  • Reviewers are also subject matter experts and assist authors with technical reviews during the planning, analyzing, and writing phases.
  • Analysts are responsible for researching the stats and trends used throughout the Almanac. Analysts work closely with authors and reviewers during the planning phase to give direction on the types of stats that are possible from the dataset, and during the analyzing/writing phases to ensure that the stats are used correctly.
  • Editors are technical writers who have a penchant for both technical and non-technical content correctness. Editors have a mastery of the English language and work closely with authors to help wordsmith content and ensure that everything fits together as a cohesive unit.
  • The section coordinator is the overall owner for all chapters within a section like "User Experience" or "Page Content" and helps to keep each chapter on schedule.

Note: The time commitment for each role varies by the chapter's scope and complexity as well as the number of contributors.

For an overview of how the roles work together at each phase of the project, see the Chapter Lifecycle doc.

Milestone checklist

0. Form the content team

  • [x] May 1: The content team has at least one author, reviewer, and analyst

1. Plan content

  • [x] May 15 The content team has completed the chapter outline in the draft doc

2. Gather data

  • [x] June 1: Analysts have added all necessary custom metrics and drafted a PR (example) to track query progress
  • June 1 - 15: HTTP Archive runs the June crawl

3. Validate results

  • [ ] August 1: Analysts have queried all metrics and saved the output to the results sheet

4. Draft content

  • [ ] September 1: The content team has written, reviewed, and edited the chapter in the doc

5. Publication

  • [ ] September 15: The completed chapter and all required metadata and figures are converted to markdown and submitted to GitHub
  • September 26: Target launch date πŸš€

Chapter resources

Refer to these 2022 HTTP resources throughout the content creation process:

πŸ“„ Google Docs for outlining and drafting content πŸ” SQL files for committing the queries used during analysis πŸ“Š Google Sheets for saving the results of queries πŸ“ Markdown file for publishing content and managing public metadata πŸ’¬ #web-almanac-http on Slack for team coordination

rviscomi avatar Apr 12 '22 17:04 rviscomi

I can help being a analyst and/or reviewer for this chapter.

paivaspol avatar Apr 16 '22 10:04 paivaspol

Happy to review too and, as last year’s analyst, happy to help this years analyst with any questions or queries.

tunetheweb avatar Apr 16 '22 10:04 tunetheweb

As always I'm interested in contributing here, but I won't know before May 1 (or even May 15) if I can be an author or just a reviewer this year...

rmarx avatar Apr 20 '22 07:04 rmarx

Thanks for your interest in contributing to the HTTP chapter this year @paivaspol @tunetheweb @rmarx!

We still don't have an author/lead for this chapter, which puts us behind on Milestone 0 and at risk of having to shut it down. Please consider reaching out to your professional networks to see if anyone else would be interested to contribute.

Meanwhile, it's not too early to start making progress on Milestone 1, which is to complete the chapter outline and is due on May 15. Make sure you have access to the chapter planning doc and also the #web-almanac-http channel on Slack. Even without an author committed yet, you can still start adding your ideas to the doc for things that should be covered this year.

Thanks everyone!

rviscomi avatar May 02 '22 20:05 rviscomi

Hi @tunetheweb @rmarx! Excited to be working with both of you. This is my first time contributing so I'll have a lot to learn from both of you.

I added a rough outline in the doc--basically short phrases on the high level idea in that topic borrowing some ideas from previous years :) . Please take a look, make changes, and leave any comments. I think once we have our outline relatively set, we can add specifics of the data that we need to support each point.

@rviscomi I can be an author. You can remove me from a reviewer.

paivaspol avatar May 03 '22 00:05 paivaspol

Great thanks @paivaspol! πŸŽ‰

As the lead author, you'll be responsible for keeping this chapter on schedule and meeting the milestones at the top of this issue. You have edit access to that comment, so please keep it updated as you complete your milestones or if the contributors change. You'll also be responsible for coordinating with the rest of the team to make sure everyone is contributing as needed. Let us know if you have any questions/concerns. Thanks again!

rviscomi avatar May 03 '22 14:05 rviscomi

This is an interesting one we haven't explored in the HTTP Chapter before: https://twitter.com/HTTPArchive/status/1521667976969310208?s=20&t=Ac330HAr5spnTOfup3hC8w

We could look at TTFB for home pages from HTTP Archive crawl, and from origin-level (in CrUX dataset) and see how TTFB for our HTTP connections are impacting web performance?

Or maybe it's one for Performance chapter?

tunetheweb avatar May 04 '22 12:05 tunetheweb

IMO, it might be more appropriate in the Performance chapter as the HTTP protocol itself isn't the only impacting factor and RTT could be the larger impacting factor.

Tangentially, a challenge that I can think of is that TTFB from HTTP Archive crawl may not line up with TTFB from CrUX? I've never looked at it myself, but it wouldn't be surprised if they don't line up as one is from a lab setting and the other is RUM.

That said, if we compare the TTFB results from HTTP Archive and CrUX and they don't align, there might be a potential for a small section pointing out the importance of having both synthetic measurements and using RUM. What do you think?

paivaspol avatar May 05 '22 11:05 paivaspol

@rviscomi we currently do not have an editor, while I have some experience writing in the past, I'm sure my writing is far from perfect. :) I'm wondering what would be the process without an editor?

paivaspol avatar May 05 '22 11:05 paivaspol

WebPageTest (which HTTP Archive uses for the crawl) grabs the CrUX TTFB, originally to display this in the WebPageTest website for that test, but luckily for us it saves that CrUX TTFB in the result ready for us to query.

Agree the crawl TTFB is of limited use as from US data centre and lab-based. The CrUX TTFB (which, as per above, we also get now) is much more useful.

tunetheweb avatar May 05 '22 11:05 tunetheweb

@paivaspol I can act as editor and potentially also co-author for some specific parts if you need/want assistance :)

rmarx avatar May 05 '22 12:05 rmarx

Think you might be better as Reviewer (check technical accuracy and also help suggest and review content) rather than Editor (check for typos, formatting and consistency with rest of Web Almanac) @rmarx ?

tunetheweb avatar May 05 '22 13:05 tunetheweb

Let's call it a Reviewing Editor or Editing Reviewer then and split the difference? :) I will of course also still review the contents (if wanted), but in my experience I always do an editing pass automatically when reviewing content.

rmarx avatar May 05 '22 13:05 rmarx

@rmarx Thank you! You help is very welcomed especially for the HTTP/3 parts of the chapter.

paivaspol avatar May 06 '22 12:05 paivaspol

@paivaspol Holy Hell! I've only just now realized you're the first author of the Vroom paper! Big fan of your work :)

rmarx avatar May 06 '22 14:05 rmarx

Just wanted to say the outline is looking great, nice work team! πŸš€

rviscomi avatar May 06 '22 18:05 rviscomi

Added a section on the metrics needed for the section.

@rmarx @tunetheweb I left some comments specifically on the metrics. Can you please take a look and leave your thoughts there? I'm relocating towards the end of this month, so I plan to start on the metrics earlier so that we can hit the June 1 deadline for the metrics. Thanks!

paivaspol avatar May 09 '22 23:05 paivaspol

I'm happy to lend some reviewing or editing help once the chapter is formed up a bit more.

LPardue avatar May 11 '22 18:05 LPardue

@LPardue Thank you for helping! I added you as an editor in the top comment above.

paivaspol avatar May 12 '22 12:05 paivaspol

Similar to comment above, I'd suggest @LPardue 's expertise on the subject makes him a Reviewer rather than an Editor. Some light editing is expected of reviewers while reviewing obviously, but the Editor role is specifically about enforcing Spelling, grammar and Web Almanac standards to the chapter.

tunetheweb avatar May 12 '22 14:05 tunetheweb

I think we have a reasonable outline and metrics to support each point. I'll start working on the next item and draft a PR to track the progress on queries for the metrics needed for the chapter.

paivaspol avatar May 17 '22 12:05 paivaspol

@rmarx @tunetheweb @LPardue @rviscomi I completed most of the queries except for one on quantifying number of link rel="preload" and quantifying the HTTP versions announced via alt-svc headers. I'll finish those by this upcoming weekend or latest mid-next week.

For the ones that I completed, I created a section called Takeaways in the chapter doc. The link to the spreadsheet is provided at the beginning of the Takeaway chapter. Can you please take a look and let me know your thoughts? If there's an error to the query, please let me know as well. The queries are in the PR linked above.

Finally, I started putting together some text for the chapter draft. Right now, nothing is worth looking at. I'll keep you posted once I have a more reasonable draft to make a pass on.

paivaspol avatar Aug 03 '22 04:08 paivaspol

@rmarx @paulcalvano @rviscomi @rmarx I completed the first draft for the chapter. Please take a look and let me know what you think.

paivaspol avatar Aug 22 '22 20:08 paivaspol

I have just completed a full editing/reviewing pass on the first draft.

Most of this was replacing literal repetitions in the same sentence / subsequent sentences with acronyms or removing them (you have a tendency to repeat yourself a lot apparently, @paivaspol :P). Some minor sentences/parts were edited for correctness/clarity. I've done all this as suggestions to make it easier to track.

There are still some open comments that need to be addressed for the final version though, especially in the HTTP/3 part, though none seem to be much work.

Thanks for writing the text for this year @paivaspol. It's a nice, to-the-point chapter that discusses the high-level challenges and potential solutions, as well as the necessary data to back it up!

rmarx avatar Sep 01 '22 12:09 rmarx

Thanks for the work @paivaspol . I plan to review post and comments by ~4 days from now.

LPardue avatar Sep 01 '22 13:09 LPardue

Thanks everyone for the help! There are some comments left that I haven't got a chance to get to. I'll address them this weekend.

paivaspol avatar Sep 02 '22 03:09 paivaspol

I resolved most of the comments in the text. Can you please take a look?

paivaspol avatar Sep 05 '22 02:09 paivaspol

Just went through the latest changes. Looks good overall. Made a few small changes directly for clarification (e.g., the long list of alt-svc options).

Added a single new issue about how to mention CDNs that offer HTTP/3. Probably something to look at for @tunetheweb as well :)

rmarx avatar Sep 05 '22 12:09 rmarx

@paivaspol I just completed my review.

Firstly, a big thank you for penning everything here! It is a good and interesting read.

I've mainly pointed out minor things that are mostly editorial but verge a little on the technical side. I hope that these are straightforward to resolve quickly.

I noted a few other things that might be interesting to dig into. But that is probably me coming too late to the table - so I completely understand if they are closed with no action in order for us to publish the chapter.

LPardue avatar Sep 05 '22 22:09 LPardue

I think the chapter is in a good state. Thanks @rmarx @LPardue and @tunetheweb for all the valuable feedback!

Currently, there are two comments left in the chapter:

  1. On whether we want to see if sites with HTTP/2+ have better CWV metrics. @tunetheweb do you still want to run some queries on that? I'm okay with not having that measurements too.
  2. On some queries that is returning weird results for resources served from Google. I also mentioned this in more detail at https://github.com/HTTPArchive/almanac.httparchive.org/pull/3108/files.

@LPardue you mentioned that there might be some external reviewers who are interested. feel free to share the chapter with them and let me know if there are any comments / concerns.

I'll start converting the other sections of the chapter into Markdown. Finally, I'll be offline between 9/9 - 9/12 but will get try to get this done by 9/15.

paivaspol avatar Sep 08 '22 04:09 paivaspol