Make machines work for you

Last updated on 2026-04-14 | Edit this page

Estimated time: 25 minutes

Overview

Questions

  • What is the difference between metadata and rich metadata?
  • How can a rich metadata file be created?
  • Where should rich metadata be stored?

Objectives

  • Understand the difference between plain metadata and rich metadata.
  • Learn where machine-readable metadata comes from and how to generate it.
  • Identify good places to publish rich metadata files.
Callout

FAIR principles used for rich metadata

Findable:

Interoperable:

What is the difference between metadata and rich metadata?


Metadata is data about the data. It describes properties of a digital object, such as title, creator, publisher, size, or identifier.

Example of metadata fields shown in a repository
Repository metadata example
Metadata attribute Example
Descriptive metadata DOI
Structural metadata Data size
Administrative metadata Publisher
Statistical metadata Number of files

Rich metadata goes further. It is:

  • standardized
  • structured
  • machine-readable
  • based on shared vocabularies
  • suitable for search engines and automated reuse
Callout

Rich metadata is more than plain text

Metadata alone can be descriptive but still hard for machines to interpret. Rich metadata uses a structured format such as JSON-LD and shared schemas such as Schema.org, DataCite, or Dublin Core.

Further reading:

Additional walkthrough:

How can a rich metadata file be created?


Researchers usually do not need to write rich metadata by hand. In many cases, it can be exported from a repository or generated through a form-based tool.

Platform Source Online Note
Dataverse export button https://dataverse.nl/dataset.xhtml?persistentId=doi:10.34894/Q80QUE Yes Fastest path for datasets
FAIR Metadata Wizard https://maastrichtu-ids.github.io/fair-metadata-wizard/ Yes Tailored to scientific projects
NSDRA JSON-LD generator https://nsdra.github.io/nsdra-jsonld-metadata-generator-webapp/# Yes Community-specific but adaptable
Steal Our JSON-LD https://jsonld.com/json-ld-generator/ Yes General-purpose
JSON-LD Schema Generator for SEO https://hallanalysis.com/json-ld-generator/ Yes Broad but SEO-oriented
Repository export interface showing JSON-LD options
Exporting JSON-LD metadata from a repository

Where should a rich metadata file be stored?


A simple rule is:

Callout

Publish rich metadata everywhere the data lives

  • in the project root folder
  • in the data repository
  • in the GitHub repository
  • on the project website

The exact schema may differ by context, but common choices include:

Rich metadata also helps connect publications, datasets, and project websites. Without structured metadata, search engines and aggregators may not recognize the resource as a dataset at all.

Discussion

Scenario

You are creating a project website for a research consortium.

How should rich metadata relate to that website? Which information belongs in the HTML, which belongs in repository records, and how do those pieces work together to improve discovery?

Key Points
  • Rich metadata combines descriptive metadata with shared vocabularies and a structured machine-readable format.
  • JSON-LD is a common way to publish rich metadata.
  • Repositories can often generate rich metadata automatically.
  • Rich metadata should be published anywhere the digital object is stored or represented.