almanac.httparchive.org icon indicating copy to clipboard operation
almanac.httparchive.org copied to clipboard

CMS 2022

Open rviscomi opened this issue 2 years ago • 21 comments

CMS 2022

CMS illustration

If you're interested in contributing to the CMS chapter of the 2022 Web Almanac, please reply to this issue and indicate which role or roles best fit your interest and availability: author, reviewer, analyst, and/or editor.

Content team

Lead Authors Reviewers Analysts Editors Coordinator
@sirjonathan @sirjonathan @alexdenning @dknauss @alonkochba @honzasladek @csliva - @tunetheweb
Expand for more information about each role 👀
  • The content team lead is the chapter owner and responsible for setting the scope of the chapter and managing contributors' day-to-day progress.
  • Authors are subject matter experts and lead the content direction for each chapter. Chapters typically have one or two authors. Authors are responsible for planning the outline of the chapter, analyzing stats and trends, and writing the annual report.
  • Reviewers are also subject matter experts and assist authors with technical reviews during the planning, analyzing, and writing phases.
  • Analysts are responsible for researching the stats and trends used throughout the Almanac. Analysts work closely with authors and reviewers during the planning phase to give direction on the types of stats that are possible from the dataset, and during the analyzing/writing phases to ensure that the stats are used correctly.
  • Editors are technical writers who have a penchant for both technical and non-technical content correctness. Editors have a mastery of the English language and work closely with authors to help wordsmith content and ensure that everything fits together as a cohesive unit.
  • The section coordinator is the overall owner for all chapters within a section like "User Experience" or "Page Content" and helps to keep each chapter on schedule.

Note: The time commitment for each role varies by the chapter's scope and complexity as well as the number of contributors.

For an overview of how the roles work together at each phase of the project, see the Chapter Lifecycle doc.

Milestone checklist

0. Form the content team

  • [x] May 1: The content team has at least one author, reviewer, and analyst

1. Plan content

  • [x] May 15 The content team has completed the chapter outline in the draft doc

2. Gather data

  • [x] June 1: Analysts have added all necessary custom metrics and drafted a PR (example) to track query progress
  • June 1 - 15: HTTP Archive runs the June crawl

3. Validate results

  • [x] August 1: Analysts have queried all metrics and saved the output to the results sheet

4. Draft content

  • [ ] September 1: The content team has written, reviewed, and edited the chapter in the doc

5. Publication

  • [ ] September 15: The completed chapter and all required metadata and figures are converted to markdown and submitted to GitHub
  • September 26: Target launch date 🚀

Chapter resources

Refer to these 2022 CMS resources throughout the content creation process:

📄 Google Docs for outlining and drafting content 🔍 SQL files for committing the queries used during analysis 📊 Google Sheets for saving the results of queries 📝 Markdown file for publishing content and managing public metadata 💬 #web-almanac-cms on Slack for team coordination

rviscomi avatar Apr 12 '22 17:04 rviscomi

Calling the 2021 CMS super team!: @alonkochba @alankent @andreylipattsev @logicalphase @saterchris @rviscomi @TosinArasi

Are any of you interested in contributing again this year?

Also can any of you suggest potential authors/reviewers/analysts who have domain expertise in the CMS space that might make good contributors? Would love if you could reach out and/or tag suggestions here to see if we can get new people involved!

tunetheweb avatar Apr 15 '22 19:04 tunetheweb

I'm an analyst under the SEO section, but we have plenty of help over there. I can help with CMS as an analyst here as well.

I've used and built sites for wordpress, joomla, hubspot, drupal, shopify, ghost, contentful, magento, django, and Adobe experience manager so I should have the domain experience here.

csliva avatar Apr 21 '22 21:04 csliva

Excellent, would love to have you on board @csliva !

tunetheweb avatar Apr 21 '22 21:04 tunetheweb

Hi @tunetheweb I'm happy to help with this.

I can help with some content about headless CMS space. (Note: I'm currently doing https://github.com/contember which is a headless CMS so would need good reviewers to not introduce some bias into this chapter.)

Otherwise I've got 16 years of experience building websites. Including CMSes like Wordpress, Drupal, Joomla, Strapi, Directus, Sylius (not so much a cms), Prismic, Netlify CMS, ....

honzasladek avatar Apr 26 '22 08:04 honzasladek

Thanks @honzasladek , presume you're looking to author / co-author the chapter then?

tunetheweb avatar Apr 26 '22 17:04 tunetheweb

@tunetheweb I'm happy to serve as lead author. I spoke with @alexdenning and he's happy to be a reviewer. @honzasladek, if you're up for it, I'd love to work with you as a reviewer - your experience in a wide range of CMS and the fact that you're focused now on Contember would bring some great perspective into the mix.

sirjonathan avatar Apr 29 '22 17:04 sirjonathan

Great to see the CMS chapter having an author. I can also assist in reviewing and anything you want to know about last year's chapter or data.

alonkochba avatar Apr 29 '22 18:04 alonkochba

@sirjonathan Sounds great, happy to participate as a reviewer!

honzasladek avatar May 01 '22 09:05 honzasladek

Thank you for agreeing to be the lead @sirjonathan !!

One of the first thing's I'll hand over to you @sirjonathan is maintenance of the first comment at the top of this issue. If you could ensure that's kept up to date with contributors that join and leave, be aware of the milestones and dates, and tick off each of these as you complete them. We have an overall Web Almanac issue tracker, that's automatically driven based on the info in that comment. And since we now have a full team, you can no tick off Milestone 0!

And @sirjonathan @alexdenning @alonkochba @honzasladek and @csliva if you could all request access to the CMS doc and start brainstorming ideas of what you want to include in the chapter.

We also have the HTTP Archive CMS slack channel if you could all join that?

It would be good to review previous years chapters and data to get a sense of what we're looking to create here:

Year Chapter Results Sheet
2021 2021 CMS Chapter 2021 Results Sheet
2020 2020 CMS Chapter 2020 Results Sheet
2019 2019 CMS Chapter 2019 Results Sheet

Some tips and advice from me, if I may be so bold:

  • Review all of last year's CMS queries and data and decide which you will likely want to use again this year. @csliva can then work on updating them to 2022.
  • Use them to consider new ideas for this year to differentiate the chapter, rather than it being just the exact same as previous years with refreshed data.
  • We are planning on using the 1st June data for the 2022 Web Almanac so that gives us exactly 1 month to get any new custom metrics in there so don't have huge amount of time. Custom metrics are bits of JavaScript we run on the rendered DOM to gather more information than the raw HTTP response bodies gives it - which is MUCH more accurate, easier (and cheaper!) to query than running regexs over raw HTML responses that don't even include any JavaScript created elements. Look at our current list of raw metrics that we use in our run to get some idea.
  • We also use Wappalyzer to detect technologies - including CMS. Here's the list of CMSes it detects: https://www.wappalyzer.com/technologies/cms. Are there any major ones missing? Can they be detected (note some CMSes cannot be detected by it?). If so, can we open an issue (or better yet a PR!) to Wappalyzer before 1st June to detect that so we'll have it for our analysis?

Many chapters find it helpful to have a kick off call with the team - to put faces to names, and introduce yourselves. I'\m happy to join that if you want to set it up @sirjonathan ?

I'll be keeping a close eye on the chapter and feel free to reach out to me here/Slack/Twitter/Email if you have any questions or thoughts.

tunetheweb avatar May 01 '22 14:05 tunetheweb

If I am not too late, I am happy to contribute as Author or Analyst. My experience and bio can be found here: https://byaman.com/about/

amandeepsinghvirdi avatar May 01 '22 20:05 amandeepsinghvirdi

Thanks @amandeepsinghvirdi! I'll defer to @sirjonathan as the content team lead to fit you in as needed. Meanwhile, request access to the chapter planning doc and add any ideas to the outline.

rviscomi avatar May 03 '22 14:05 rviscomi

@amandeepsinghvirdi has since agreed to take on Ecommerce lead so not sure if the offer is still there as might be busy with that one.

tunetheweb avatar May 03 '22 14:05 tunetheweb

@sirjonathan @alexdenning @alonkochba @honzasladek @csliva reminder to access the planning doc and start adding your ideas to the outline. What's new with CMS this year, or what topics would be interesting to revisit from previous years?

We're hoping to have the outline completed by May 15 to give us enough time to add any needed custom metrics to the June crawl before it kicks off. Thanks!

Would it also be helpful to have a video chat to introduce yourselves to each other? Here's a suggested agenda some of the other chapters are using for that: https://docs.google.com/document/d/1nqPjvY0zg2YJda3ghIJuvVaSeY_wbz_nwBK15tHULsg/edit?usp=sharing

I'm happy to attend anything you all set up. I'm UK timezone, but reasonably flexible.

tunetheweb avatar May 06 '22 16:05 tunetheweb

@sirjonathan @alexdenning @alonkochba @honzasladek @csliva we still don't have a draft outline so have missed our 15th May deadline. This means the chances of adding any NEW data to the 1st June crawl is getting slimmer by the day (since we don't yet know what we want to capture) so you may have to just rely on the existing data that HTTP Archive already collects.

But , it's still not too late. So I would suggest:

  1. Copying last years outline into the doc as a basic first step and then iterating on that.
  2. Reviewing the results from last year, and the queries from last year to see if that encourages any thoughts on new data you might want that wasn't considered last year.
  3. Review the CMS technologies captured by Wappalyzer to see if anything major is missing that we might want to see if we can get added

Happy to answer any questions on any of this!

tunetheweb avatar May 17 '22 12:05 tunetheweb

@tunetheweb Thank you for the reminders and the support! I've been gathering notes / ideas over the past week and reviewing the previous years results. I will make sure the outline is drafted and ready for review this week.

Also, @dknauss who serves as Editor over at Post Status, is up for being a reviewer.

sirjonathan avatar May 18 '22 06:05 sirjonathan

Sent you an invite to the GitHub org @dknauss so you can edit issues and the like.

tunetheweb avatar May 19 '22 12:05 tunetheweb

@tunetheweb I've updated the draft doc with an outline, using last year as a starting point.

I've added a few questions as comments within the doc that I'd love feedback on. (FYI, @alexdenning @alonkochba @honzasladek @csliva @dknauss - take a look when you're able and add your questions / comments directly to the doc)

No changes in queries from last year stand out to me. One thing I am curious about is if we can include some comparisons to Shopify or, perhaps more efficiently, we could do some collaboration with the Ecommerce chapter team.

sirjonathan avatar May 20 '22 23:05 sirjonathan

Hey all, apologies I've been a bit quiet lately, but been travelling and not had too much Almanac time lately.

How are we looking now? Milestone 2's date has passed by quite a bit now, but the crawl is still going on (not sure if you saw Rick's update on Slack in the Web Almanac channel?), so that gives us a bit of time to catch up.

We didn't need to add any new Custom Metrics for this chapter, so we just need to open a draft PR with a list of queries to write to complete that Milestone (note you do not need to have the queries written to complete Milestone 2 - just have a list of them). You can see example open PRs for the other chapters already open.

As it looks like you're just currently planning to reuse last year's queries that should be an easy task! @csilva as the analyst for this chapter can you do this? You can also copy all the queries from sql/2021/cms to sql/2022/cms directory and update the dates from 202110701 to 202206021 to really get ahead of the game and be ready to run them when the crawl finishes.

No changes in queries from last year stand out to me. One thing I am curious about is if we can include shttps://github.com/wappalyzer/wappalyzer/issues/6256#issuecomment-1133451774 or, perhaps more efficiently, we could do some collaboration with https://github.com/HTTPArchive/almanac.httparchive.org/issues/2897.

That definitely should be possible. Should just be a matter of replacing:

WHERE  category = 'CMS'

with:

WHERE  category = 'CMS' or App = 'Shopify'

Just need to decide if you want to run all queries with this, or only some of them? Or perhaps you want to run some or all queries twice (once with this, once without) to allow comparison to pervious years for some metrics?

tunetheweb avatar Jun 10 '22 11:06 tunetheweb

Thanks for the directions @tunetheweb. I'll go ahead and get that PR drafted. I will also see what we can do about platforms like Shopify. Shopify includes a blog and it is very much in the game for Content Management.

csliva avatar Jun 11 '22 13:06 csliva

Friendly ping on that draft PR @csliva . Most of the chapters have their draft open now.

tunetheweb avatar Jun 17 '22 13:06 tunetheweb

Hey @sirjonathan and @csliva, just to let you know that the data is all ready to start querying, so you can start writing (and even running) your queries. Ones that are straight copies of last year you can probably just run (with the updated dates of course), but if you do come up with any new ones, then might wanna ping the other analysts (including myself!) to have a quick review before running the full query (unless it's a very small, quick table in which case go right ahead).

Do reach out if you've any questions or need any help. Looking forward to seeing what insights you find!

tunetheweb avatar Jun 24 '22 18:06 tunetheweb

Hey y'all. The draft is ready for review and per discussion with @csliva, we're slated to have all feedback incorporated and the text complete by Friday, September 9, in order to allow sufficient time for markdown generation for the 15th deadline.

Overall, I've kept the structure from last year, simplifying in a few places and adding additional data where we have it (in particular, the Page Builders section).

Of the current draft, two items remain from my perspective:

  1. WordPress in 2022 - I'd like to incorporate any relevant highlights from 2021 on performance improvements in core development. @dknauss are there any items that come to mind for you, beyond the work of the Core Performance team?

  2. Conclusion - I've left the current conclusion in draft mostly unchanged from @alonkochba's work last year. After incorporating any further feedback, I'll take a fresh pass at it.

@alexdenning @alonkochba @dknauss @honzasladek Would you review the draft over the next few days and add in any comments or suggestions you have? I'll monitor your feedback throughout the week and then submit my final pass on Friday.

Thank y'all for your help!

sirjonathan avatar Sep 06 '22 04:09 sirjonathan