schema.org — Structured Data for the Whole Web

The pragmatic vocabulary that won the web by choosing simplicity over rigour.

0/2 done

Theory

Pragmatism at planet scale

schema.org is the vocabulary behind structured data on the web. Launched by Google, Microsoft (Bing), Yahoo and Yandex, it lets page authors mark up what a page is about — a product, an event, a recipe, an organisation — so search engines produce rich results, knowledge panels and better answers. It is open and, deliberately, lightweight.

  • Deliberately informal — schema.org is closer to RDFS than strict OWL DL. domainIncludes / rangeIncludes express soft expectations, not hard logical restrictions; multi-typing is normal. This looseness is a feature: millions of non-experts can use it correctly.
  • Three serialisationsJSON-LD (the recommended, script-tag form), Microdata, and RDFa. JSON-LD won because it doesn't entangle markup with visible HTML.
  • Extensible — hosted and external extensions add depth: GS1 (products), bioschemas (life science), automotive, and more — so the simple core can be specialised per industry.
  • Why it matters commercially — structured data drives SEO and visibility; it's the most economically deployed vocabulary on this list, on hundreds of millions of pages.

Use Case Example: A shop adds a JSON-LD Product block with price, availability and reviews. Google renders a rich result with stars and price directly in search — measurably lifting click-through — purely because the page spoke a shared, simple vocabulary the crawler already understood.

Analogy

schema.org is the pidgin / lingua franca of the web. A pidgin wins not by being grammatically perfect but by being simple enough that everyone speaks it. A logically rigorous ontology is like demanding flawless Latin from every market trader — correct and useless. schema.org asked for just enough shared grammar that millions of webmasters could mark up a page right, and that modesty is exactly why it conquered the web.

Markup → crawler → rich result

Click a node to focus its neighbourhood · drag to pan · scroll to zoom

How structured data flows to a rich result

A page embeds JSON-LD using schema.org; the crawler turns shared terms into enhanced search features.

Field guide — schema.org, SKOS & SSSOM

Web and mapping field guide

schema.org and crosswalk work reward restraint. You are not always trying to prove strict logical identity.

TaskBest toolWhy
Mark up a public product/event/article pageschema.org JSON-LDcrawlers need simple, tolerant structured data
Publish a product identifier on the webGS1 Digital Link + schema.org/GS1 vocone URL can resolve to consumer, logistics and compliance views
Align an internal class under a standardrdfs:subClassOfhonest anchoring without claiming identity
Declare two classes exactly the sameowl:equivalentClassonly when a reasoner may safely merge their meaning
Say two terms are close but not identicalskos:closeMatchprevents false inference across granularity gaps
Govern a mapping tableSSSOM-style metadatacaptures author, predicate, confidence, evidence and version

The practical warning: the most damaging mapping bug is a false exact match. Prefer a weaker, honest relation over a strong, convenient lie. Your future reasoner, report and regulator will thank you.

Reflect

schema.org is the counterweight to the rest of this track: where FIBO and ISO 15926 buy power with rigour, schema.org bought ubiquity with restraint. Both bets are valid — the lesson is to match formality to the audience, not to maximise it.

  • When is 'soft expectations, not hard axioms' the right call for a vocabulary you design?
  • Where might schema.org's looseness actually hurt — and you'd want OWL/SHACL instead?

Reading in progress · 0 of 2 activities done