Skip to content
forked from w3c/webref

Machine-readable references and analyses of terms defined in web browser specifications

License

Notifications You must be signed in to change notification settings

linkeddata/webref

 
 

Repository files navigation

Webref

Description

This repository contains machine-readable references of CSS properties, definitions, IDL, and other useful terms that can be automatically extracted from web browser specifications. The contents of the repository are updated automatically every 6 hours (although note information about published /TR/ versions of specifications are updated only once per week).

Specifications covered by this repository are technical Web specifications that are directly implemented or that will be implemented by Web browsers; in other words, those that appear in browser-specs.

This repository contains raw and automatically-generated extracts from web browser specifications. These extracts come with no guarantee on validity or consistency. For instance, if a specification defines invalid IDL snippets or uses an unknown IDL type, the corresponding IDL extract in this repository will be invalid as well.

Curated subsets of the repository content are published as NPM packages, updated on a weekly basis when the underlying content has changed:

The NPM packages provide additional validity and consistency guarantees. Unless you are ready to deal with invalid content, we recommend that you rely on the content of these NPM packages instead of on the non-curated content in this repository.

Available extracts

This repository contains information about latest Editor's Drafts of Web specifications in the ed folder, as well as about the latest published version (for /TR/ specifications) in the tr folder. More often that not, published versions of specifications are much older than their latest Editor's Draft. Data in the tr folder is more invalid/inconsistent than data in the ed folder as a result.

The following subfolders contain individual machine-readable JSON or text files generated from specifications:

  • ed/css and tr/css: CSS terms (properties, descriptors, value spaces). One file per specification series.
  • ed/dfns and tr/dfns: <dfn> terms, along with metadata such as linking text, access level, namespace. One file per specification.
  • ed/elements and tr/elements: Markup elements defined, along with the interface that they implement. One file per specification.
  • ed/headings and tr/headings: Section headings. One file per specification.
  • ed/idl and tr/idl: Raw WebIDL index. One file per specification series.
  • ed/idlparsed and tr/idlparsed: Parsed WebIDL. One file per specification.
  • ed/links and tr/links: Links to other documents, along with targeted fragments. One file per specification.
  • ed/refs and tr/refs: Normative and informative references to other specifications. One file per specification.

Individual files are named after the shortname of the specification, or after the shortname of the specification series for CSS definitions and raw IDL files. Individual files are only created when needed, meaning when the specification actually includes relevant terms.

The ed/index.json and tr/index.json files contain the index of specifications that have been crawled, and relative links to individual files that have been created.

This repository uses Reffy, a Web spec exploration tool, to crawl the specifications and generate the data. In particular, the data it contains is the result of running Reffy. The repository does not contain any more data.

Raw WebIDL extracts are used in web-platform-tests, please see their interfaces/README.md for details.

Potential spec anomalies

On top of data extracted from the specifications, this repository also contains analyses of potential anomalies that specifications may have, such as missing references and invalid WebIDL definitions. Anomaly reports may contain false positives.

How to suggest changes or report an error

Feel free to raise issues in this repository as needed. Note that most issues likely more directly apply to underlying tools:

  • Errors in the data are most likely caused by bugs or missing features in Reffy, which is the tool that crawls and parses specifications under the hoods. If you spot an error, please report it in Reffy's issue tracker.
  • If you believe that a spec is missing from the list, please check browser-specs and report it there.

Development notes

GitHub Actions workflows are used to automate most of the tasks in this repo.

Data update

  • Update ED report - crawls the latest version of Editor's Drafts and updates the contents of the ed folder. Workflow runs every 6 hours. A typical crawl takes about 10mn to complete.
  • Update TR report - crawls the published version of Editor's Drafts and updates the contents of the tr folder. Workflow runs once per week on Monday. A typical crawl takes about 10mn to complete.
  • Test: tests the contents of the repo. Runs each time there is a push against the default branch.
  • Clean up abandoned files - Checks the contents of repository to detect orphan crawl files that are no longer targeted by the latest crawl's result and creates a PR to delete these files from the repository. Runs once per week on Wednesday. The crawl workflows does not delete these files automatically because crawl sometimes fails on a spec due to transient network or spec errors.

Releases to NPM

  • @webref/css: Prepare release PR if needed - Checks latest CSS extracts and create a pre-release PR if a new version of the @webref/css npm package should be released. Runs after each crawl and whenever a push is made to the default branch on CSS files (except when this push is on the packages folder to avoid re-entrance issues). The pre-release PR details the diff that would be released, and bumps the package version in packages/css/package.json.
  • @webref/elements: Prepare release PR if needed - Checks latest elements extracts and create a pre-release PR if a new version of the @webref/elements npm package should be released. Runs after each crawl and whenever a push is made to the default branch on elements files (except when this push is on the packages folder to avoid re-entrance issues). The pre-release PR details the diff that would be released, and bumps the package version in packages/elements/package.json.
  • @webref/idl: Prepare release PR if needed - Checks latest IDL extracts and create a pre-release PR if a new version of the @webref/idl npm package should be released. Runs after each crawl and whenever a push is made to the default branch on IDL files (except when this push is on the packages folder to avoid re-entrance issues). The pre-release PR details the diff that would be released, and bumps the package version in packages/idl/package.json.
  • @webref release: Request review of pre-release PR - Assigns reviewers to pre-release CSS/IDL PRs if they exist. Runs once per week on Thursday.
  • Publish @webref package if needed - Publishes a new version of the @webref/css, @webref/elements or @webref/idl package to npm and tags the corresponding commit on the default branch. Runs whenever a pre-release PR is merged. Note that the released version is the version that appeared in packages/css/package.json, packages/elements/package.json or packages/idl/package.json before the pre-release PR is merged.

About

Machine-readable references and analyses of terms defined in web browser specifications

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 99.4%
  • Other 0.6%