Set up your own terms

Last updated on 2026-04-14 | Edit this page

Estimated time: 25 minutes

Overview

Questions

  • What are data terms of use?
  • What should a data terms of use statement contain?
  • What format should terms of use use?
  • What standard licenses are available for data?

Objectives

  • Understand what data terms of use are and why they matter.
  • Identify the minimum components of a basic terms of use statement.
  • Recognize when a standard license is sufficient and when a custom agreement is needed.
Callout

FAIR principles used in data terms of use

Accessible:

Reusable:

What are data terms of use?


Data terms of use are a textual statement that sets out the rules, conditions, licenses, and legal considerations that govern reuse of a data source.

Screenshot of a data terms of use page
Terms of use example from the World Bank

Examples:

Screenshot of another terms of use page
Terms of use example from Numbeo

These examples show that terms of use usually describe the resource, the conditions under which it may be reused, and any expectations around attribution or restrictions.

What must a terms of use statement contain?


As a minimum, a data terms of use statement should cover the following elements:

Section Description Example
Description What the statement refers to and which digital objects it covers “These terms apply to the Happy Dataset.”
License Under which conditions reuse is allowed “The Happy Dataset is in the public domain.”
Attribution How the data should be cited or acknowledged “Please cite the Happy Dataset.”
Disclaimer Important limitations or caveats “The last 100 records may contain selection bias.”

Depending on the context, the statement may need additional clauses for multiple databases, sensitive data, embargoes, or obligations coming from a larger funded project.

An example of a broader policy framework is the FAIRsharing record for the 1958 Birth Cohort policy:

What format should terms of use use?


Terms of use should be stored as plain text in a machine-friendly format such as .txt, .md, or .html. The exact length and level of detail will vary by project, but the statement should be easy for people to read and for systems to preserve.

Callout

Keep the statement in an accessible text format

You can draft terms of use in almost any editor, but the final version should be stored in a format that does not depend on proprietary software to read it.

Many projects place the statement in a README, LICENSE, or similar documentation file.

Challenge

Exercise

Visit the City of Philadelphia terms-of-use file:

https://github.com/CityOfPhiladelphia/terms-of-use/blob/master/LICENSE.md

Answer the following:

  • What kind of resource is it about?
  • In what format is the statement written?
  • On which platform is it published?
  • It is a terms-of-use style statement published for city-maintained digital resources.
  • The file format is Markdown (.md).
  • It is published on GitHub.

The statement itself often lives next to the data documentation:

Folder structure containing a README or license file
Folder layout showing terms of use documentation

Some repositories let you define tailored reuse conditions directly on the platform. Dataverse, for example, defaults to a CC0 waiver but also allows custom terms after dataset creation.

Repository interface showing license and terms options
Terms and license settings in a repository interface
Challenge

Exercise

Is it possible to edit terms of use in DataverseNL?

Check the DataverseNL documentation or FAQ and find the answer.

Yes. After creating a dataset, you can go to the Terms tab, choose not to apply CC0, and provide your own terms and conditions.

Useful reference:

Are there standard licenses we can pick from?


Two commonly used licensing families for data are:

Creative Commons licenses are easy to understand and widely recognized, even if they were not designed only for data. Open Data Commons licenses are more explicitly data-oriented.

Mark Meaning
BY Creator must be credited
SA Derivatives or redistributions must use the same license
NC Only non-commercial uses are allowed
ND No derivatives are allowed
Challenge

Exercise

Choose a Creative Commons license for simulation data under these conditions:

  • others cannot modify the work
  • commercial reuse is allowed

Which license fits?

Attribution-NoDerivatives 4.0 International, or CC BY-ND 4.0.

Discussion

Scenario

You are collaborating on a study of quality of life in children and plan to collect potentially sensitive information about bullying, social media, and family structure.

What should be considered when drafting terms of use for this study? Should the statement be drafted only by the researchers, or should legal and governance support be involved?

Key Points
  • A data terms of use statement defines the legal and practical basis for reuse.
  • A license is the minimum requirement, but some projects need richer terms or a custom agreement.
  • Store terms of use in an accessible text format such as .md or .txt.
  • If a standard license does not fit the project, a tailored terms-of-use statement or usage agreement may be necessary.