diff --git a/src/content/en/2020/markup.md b/src/content/en/2020/markup.md
index 1a7388acfb9..d423f039d57 100644
--- a/src/content/en/2020/markup.md
+++ b/src/content/en/2020/markup.md
@@ -20,7 +20,6 @@ featured_stat_2: 30,073
featured_stat_label_2: Number of non-standard h7
elements
featured_stat_3: 25.24 KB
featured_stat_label_3: Weight of the median document
-unedited: true
---
## Introduction
@@ -76,17 +75,15 @@ A page's document size refers to the amount of HTML bytes transferred over the n
{# TODO(authors, analysts): Revisit the "largest document" stat and interpretation. #}
* 1,110 documents are empty (0 bytes).
-* The average document size is 50.35 KB ([in most cases compressed](https://w3techs.com/technologies/details/ce-gzipcompression)).
-* The largest document by far weighs 64.16 _MB_, almost deserving its own analysis and chapter in the Web Almanac.
+* The average document size is 49.17 KB ([in most cases compressed](https://w3techs.com/technologies/details/ce-gzipcompression)).
+* The largest document by far weighs 61.19 _MB_, almost deserving its own analysis and chapter in the Web Almanac.
-{# TODO(analysts): Should 25,237 bytes be divided by 1000 or 1024 to convert to KB? 1000 seems to be used here but most chapters use 1024. Are the stats above also off? #}
-{# TODO(authors): What's the implication and your interpretation of this value? Is this a surprisingly big number? Or does it align with your expectations? #}
-How is this situation in general, then? The median document weighs 25.24 KB:
+How is this situation in general, then? The median document weighs 24.65 KB, which comes [without surprises](https://httparchive.org/reports/page-weight):
{{ figure_markup(
image="document-size.png",
caption="The amount of HTML bytes transferred over the network, including compression if enabled.",
- description="Document size in bytes per percentile, with the median document weighing 26.62 KB on desktop.",
+ description="Document size in bytes per percentile, with the median document weighing 25.99 KB on desktop.",
sheets_gid="2066175354",
chart_url="https://docs.google.com/spreadsheets/d/e/2PACX-1vQPKzFb574UnGTcfw5mcD1qR7RYHyGjQTc2hiMuYix0QoTH1DPe54Q2JucXL8bfZ6kjRoAfhk3ckudc/pubchart?oid=386686971&format=interactive",
width=600,
@@ -102,26 +99,20 @@ We identified 2,863 different values for the `lang` attribute on the `html` star
22.36% of all documents specify no `lang` attribute. The commonly accepted view is that [they should](https://www.w3.org/TR/i18n-html-tech-lang/#overall), but beside the idea that software could eventually [detect language automatically](https://meiert.com/en/blog/lang/), document language can also be specified [on the protocol level](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Content-Language). This is something we didn't check.
-Here are the 10 most popular (normalized) languages in our sample. At first we contemplated merging same-language values like `en-US` and `en-GB`, but the difference was pronounced, so we decided not to do so.
+Here are the 10 most popular (normalized) languages in our sample. It's important to note that the HTTP Archive crawls from US data centers with English language settings, so looking at the language pages are written in will be skewed towards English. Nevertheless we present the `lang` attributes seen to give some context to the sites analyzed.
-
-
-{# TODO(authors): Add an interpretation of the lang results. #}
+{{ figure_markup(
+ image="document-language.png",
+ alt="The top HTML lang attributes.",
+ caption="The top HTML `lang` attributes.",
+ description="Bar chart showing the top 10 `lang` attributes used in our crawl with 22.82% of desktop and 22.36% of mobile pages not setting this, `en` being used on 20.09% and 18.08% respectively, `ja` on 15.17% and 13.27%, `es` on 4.86% and 4.09% , `pt-br` on 2.65% and 2.84%, `ru` on 2.21% 2.53%, `en-gb` on 2.35% and 2.19%, `de` on 1.50% and 1.92%, and finally `fr` being used on 1.55% and 1.43% respectively.",
+ sheets_gid="2047285366",
+ chart_url="https://docs.google.com/spreadsheets/d/e/2PACX-1vQPKzFb574UnGTcfw5mcD1qR7RYHyGjQTc2hiMuYix0QoTH1DPe54Q2JucXL8bfZ6kjRoAfhk3ckudc/pubchart?oid=1873310240&format=interactive",
+ width=600,
+ height=371,
+ sql_file="pages_almanac_by_device_and_html_lang.sql"
+ )
+}}
### Comments
@@ -131,9 +122,7 @@ Adding comments to code is generally a good practice and HTML comments are there
```
-Although many pages will have been stripped of comments for production, we found that index pages in the 90th percentile are using about 73 comments on mobile, respectively 79 comments on desktop, while in the 10th percentile the number of the comments is about 2.
-
-{# TODO(authors): How about the median number for a typical website? #}
+Although many pages will have been stripped of comments for production, we found that index pages in the 90th percentile are using about 73 comments on mobile, respectively 79 comments on desktop, while in the 10th percentile the number of the comments is about 2. The median page uses 16 (mobile) or 17 comments (desktop).
Around 89% of pages contain at least one HTML comment, while about 46% of them contain a conditional comment.
@@ -151,7 +140,7 @@ Still, on the above percentile extremes, we found that web pages are using about
For production, HTML comments are usually stripped by build tools. Considering all the above counts and percentages, and referring to the use of comments in general, we suppose that lots of pages are served without involving an HTML minifier.
-### Script use
+### Script use
As shown in the [Top elements](#top-elements) section below, the `script` element is the 6th most frequently used HTML element. For the purposes of this chapter, we were interested in the ways the `script` element is used across these millions of pages from the data set.
@@ -172,7 +161,7 @@ At the opposite end of the spectrum, the numbers show that about 97% of pages co
)
}}
-When scripting is unsupported or turned off in the browser, the `noscript` element helps to add an HTML section within a page. Considering the above script numbers, we were curious about the `noscript` element as well.
+When scripting is unsupported or turned off in the browser, the `noscript` element helps to add an HTML section within a page. Considering the above script numbers, we were curious about the `noscript` element as well.
Following the analysis, we found that about 49% of pages are using a `noscript` element. At the same time, about 16% of `noscript` elements were containing an `iframe` with a `src` value referring to "googletagmanager.com".
@@ -182,13 +171,11 @@ This seems to confirm the theory that the total number of `noscript` elements in
What `type` attribute values are used with `script` elements?
-{# TODO(authors, analysts): Should this be a figure? #}
-{# TODO(authors): Explain the significance of the "!" in text. #}
- `text/javascript`: 60.03%
- `application/ld+json`: 1.68%
- `application/json`: 0.41%
- `text/template`: 0.41%
-- `text/html` (!) 0.27%
+- `text/html` 0.27%
When it comes to loading [JavaScript module scripts](https://jakearchibald.com/2017/es-modules-in-browsers/) using `type="module"`, we found that 0.13% of `script` elements currently specify this attribute-value combination. `nomodule` is used by 0.95% of all tested pages. (Note that one metric relates to elements, the other to pages.)
@@ -351,13 +338,13 @@ Standard elements are those that are or were part of the HTML specification. Whi
Within the latest Safari and Firefox versions, setting target="_blank"
on a
elements implicitly provides the same rel
behavior as setting rel="noopener"
. This is already implemented in Chromium as well and will land in Chrome 88.