css-parsing-tests icon indicating copy to clipboard operation
css-parsing-tests copied to clipboard

Implementation-independant, JSON-based tests for CSS parsers

CSS parsing tests #################

This repository contains implementation-independent test for CSS parsers, based on the 2013 draft of the CSS Syntax Level 3_ specification.

.. _CSS Syntax Level 3: https://drafts.csswg.org/css-syntax-3/

The upstream repository for these tests is at https://github.com/SimonSapin/css-parsing-tests

Projects using this

CSS parsers using these tests:

  • tinycss2 <https://github.com/SimonSapin/tinycss2>_ (Python)
  • rust-cssparser <https://github.com/mozilla-servo/rust-cssparser>_ (Rust, used in Servo <https://github.com/mozilla/servo/>_)
  • Crass <https://github.com/rgrove/crass/>_ (Ruby)

Importing

The recommended way to use these tests in an implementation is to import them with git-subtree_.

.. _git-subtree: https://github.com/git/git/tree/master/contrib/subtree

To import the first time to a ./css-parsing-tests sub-directory, run this from the top-level of a git repository::

git subtree add -P css-parsing-tests https://github.com/SimonSapin/css-parsing-tests.git master

Later, to merge changes made in the upstream repository, run::

git subtree pull -P css-parsing-tests https://github.com/SimonSapin/css-parsing-tests.git master

Test files

CSS Syntax specification describes a number of "functions". Each .json file in this repository corresponds to such a function. The files are encoded as UTF-8 and each contain a JSON array with an even number of items, where each pair of items is one function input associated with the expected result.

component_value_list.json Tests Parse a list of component values <https://drafts.csswg.org/css-syntax-3/#parse-a-list-of-component-values>. The Unicode input is represented by a JSON string, the output as an array of component values as described below.

component_value_list.json Tests Parse a component value <https://drafts.csswg.org/css-syntax-3/#parse-a-component-value>. The Unicode input is represented by a JSON string, the output as a component value.

declaration_list.json Tests Parse a list of declarations <https://drafts.csswg.org/css-syntax-3/#parse-a-list-of-declarations>. The Unicode input is represented by a JSON string, the output as an array of declarations and at-rules_.

blocks_contents.json Tests Parse a block’s contents <http://dev.w3.org/csswg/css-syntax-3/#parse-block-contents>. The Unicode input is represented by a JSON string, the output as an array of declarations, at-rules_ and qualified rules_.

one_declaration.json Tests Parse a declaration <https://drafts.csswg.org/css-syntax-3/#parse-a-declaration>. The Unicode input is represented by a JSON string, the output as a declaration.

one_rule.json Tests Parse a rule <https://drafts.csswg.org/css-syntax-3/#parse-a-rule>. The Unicode input is represented by a JSON string, the output as a qualified rule or at-rule_.

rule_list.json Tests Parse a list of rules <https://drafts.csswg.org/css-syntax-3/#parse-a-list-of-rules>. The Unicode input is represented by a JSON string, the output as a list of qualified rules or at-rules_.

stylesheet.json Tests Parse a stylesheet <https://drafts.csswg.org/css-syntax-3/#parse-a-stylesheet>. The Unicode input is represented by a JSON string, the output as a list of qualified rules or at-rules_.

stylesheet_bytes.json Tests Parse a stylesheet <https://drafts.csswg.org/css-syntax-3/#parse-a-stylesheet>_ together with The input byte stream <https://drafts.csswg.org/css-syntax-3/#input-byte-stream>_. The input is represented as a JSON object containing:

* A required ``css_bytes``, the input byte string,
  represented as a JSON string where code points U+0000 to U+00FF
  represent bytes of the same value.
* An optional ``protocol_encoding``,
  a protocol encoding label as a JSON string, or null.
* An optional ``environment_encoding``,
  an environment encoding label as a JSON string, or null.
* An optional ``comment`` that is ignored.

The output is represented a list of `qualified rules`_ or at-rules_.

color3.json Tests the <color> syntax defined in CSS Color Level 3 <http://www.w3.org/TR/css3-color/#colorunits>_. The Unicode input is represented by a JSON string, the output as one of:

* null if the input is not a valid color in CSS syntax
* The string "currentcolor" for the currentcolor keyword
* An array of length 4 for every other values:
  four (floating point) numbers for the Red, Green, Blue and Alpha channel.
  Each value is between 0 and 1.

color3_hsl.json Same as color3.json. This file is generated by the make_color3_hsl.py Python script.

color3_keywords.json Same as color3.json, except that the values for the Red, Green and Blue channel are between 0 and 255. This file is generated by the make_color3_keywords.py Python script.

An+B.json Tests the An+B <https://drafts.csswg.org/css-syntax-3/#the-anb-type>_ syntax defined in CSS Syntax Level 3. This differs <https://drafts.csswg.org/css-syntax/#changes>_ from the nth grammar rule <http://www.w3.org/TR/css3-selectors/#nth-child-pseudo>_ in Selectors Level 3 only in that - characters and digits can be escaped in some cases. The Unicode input is represented by a JSON string, the output as null for invalid syntax, or an array of two integers [A, B].

Result representation

AST nodes (the results of parsing) are represented in JSON as follow. This representation was chosen to be compact (and thus less annoying to write by hand) while staying unambiguous. For example, the difference between @import and \@import is not lost: they are represented as ["at-keyword", "import"] and ["ident", "@import"], respectively.

Rules and declarations

.. _at-rule: .. _at-rules: .. _qualified rule: .. _qualified rules: .. _declaration: .. _declarations:

At-rule An array of length 4: the string "at-rule", the name (value of the at-keyword) as a string, the prelude as a nested array of component values_, and the optional block as a nested array of component value, or null.

Qualified rule An array of length 3: the string "qualified rule", the prelude as a nested array of component values_, and the block as a nested array of component value.

Declaration An array of length 4: the string "declaration", the name as a string, the value as a nested array of component values_, and a the important flag as a boolean.

.. _component value: .. _component values:

Component values

Array of length 2: the string ``"ident"``, and the value as a string. Array of length 2: the string ``"at-keyword"``, and the value as a string. Array of length 3: the string ``"hash"``, the value as a string, and the type as the string ``"id"`` or ``"unrestricted"``. Array of length 2: the string ``"string"``, and the value as a string. Array of length 1: the string ``"bad-string"``. Array of length 2: the string ``"url"``, and the value as a string. Array of length 1: the string ``"bad-url"``. The value as a one-character string. Array of length 4: the string ``"number"``, the representation as a string, the value as a number, and the type as the string ``"integer"`` or ``"number"``. Array of length 4: the string ``"percentage"``, the representation as a string, the value as a number, and the type as the string ``"integer"`` or ``"number"``. Array of length 4: the string ``"dimension"``, the representation as a string, the value as a number, the type as the string ``"integer"`` or ``"number"``, and the unit as a string. Array of length 3: the string ``"unicode-range"``, followed by the *start* and *end* integers as two numbers. The string ``"~="``. The string ``"|="``. The string ``"^="``. The string ``"$="``. The string ``"*="``. The string ``"||"``. The string ``" "`` (a single space.) The string ``""``. The string ``":"``. The string ``";"``. The string ``","``.

{} block An array of length N+1: the string "{}" followed by the N component values_ of the block’s content.

[] block An array of length N+1: the string "[]" followed by the N component values_ of the block’s content.

() block An array of length N+1: the string "()" followed by the N component values_ of the block’s content.

Function An array of length N+2: the string "function" and the name of the function as a string followed by the N component values_ of the function’s arguments.

The array of two strings ``["error", "bad-string"]``. The array of two strings ``["error", "bad-url"]``.

Unmatched <}> The array of two strings ["error", "}"].

Unmatched <]> The array of two strings ["error", "]"].

Unmatched <)> The array of two strings ["error", ")"].