A curated list of awesome stuff around the FAIR principles for (scientific) data, i.e that data is findable, accessable, interoperable and re-usable.
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me related semantics lists resources from awesome-fair"
Installation instructions →What's inside
Provenance tracking
- AiiDA
Automated Interactive Infrastructure and Database for Computational Science (AiiDA) to automatically track provenance of simulation workflows and all associated data,
- awesome-pipelineRelated workflow tools lists
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin.
- Awesome workflow enginesRelated workflow tools lists
Curated list of awesome open source workflow engines.
- Computational Data Analysis Workflow SystemsRelated workflow tools lists
A list of existing workflow systems.
- CWL
Domain-agnostic and community-driven open standard for description and execution of research workflows that supports provenance tracking (
- DataLad
A free and open-source distributed data management system for everyone. It is based on git-annex with manual to automatic provenance tracking,
Related lists
- Awesome Curated Tools
A curated list of digital tools we use, ranging from accounting and data science to scientific research and liquid democracy.
- Awesome-open-climate-science
An open science related list specific to the domain of Atmospheric, Ocean, and Climate science.
- Awesome-open-science-software
A list of open science resources and software.
- awesome-research-software-registies
Awesome list for where one can register or upload research software.
- awesome-rse
An awesome list by HIFIS collecting information about research software engineering, touching FAIRness and sustainability.
- awesome-rse-policies
An awesome list by HIFIS collecting information about research software engineering policies, touching FAIRness and sustainability.
Ontology services
- awesome-ontologyRelated semantics lists
A curated list of ontology things.
- awesome-semantic-toolsRelated semantics lists
List of projects related to Ontology engineering and Semantic Web technologies.
- Ontobee
A linked ontology data server to support ontology term dereferencing, linkage, query and integration. See also this
- Ontology Lookup Service
OLS is a repository for biomedical ontologies that aims to provide a single point of access to the latest ontology versions.
Resources about the FAIR principles
- Barend Mons article in Nature 578, 491 (2020)
Proposition to invest 5% of research funds in ensuring data are reusable.
- Cost of not having FAIR research data
A 2018 European Commission Cost-benefit analysis for FAIR research data (Written by PwC EU Services).
- DataPLANT ARC Tool TalkFAIR Digital Object and related projects
NFDI4plants interpretation of the FDO based on GitHub repository and RO Crate.
- DONA Suggested ReadingFAIR Digital Object and related projects
The history of the Digital Object Architecture (DOA) back into the 80s.
- FAIR Digital Object FrameworkFAIR Digital Object and related projects
A WIP specification for an FDO infrastructure based on linked data / RDF.
- FAIR Digital Objects ForumFAIR Digital Object and related projects
General platform for discussions on the advancement and development of FAIR Digital Objects.
Software and software publications
- Citable code with Zenodo & GitHub
Make GitHub repositories citable with Zenodo DOI.
- CITATION.CFF
Plain text files with human- and machine-readable citation information for software (and datasets). Supported by GitHub, Zenodo, Zotero.
- CodeMeta
Minimal metadata schema for science software and code, in JSON and XML to create a concept vocabulary that can be used to standardize the exchange of software metadata across repositories and organizations.
- fossology
Open source license compliance software system and toolkit. You can run license, copyright and export control scans from the command line.
- HERMES
A CI based workflow to create and publish software publications to well known repositories.
- SOMEF
Extract software publication metadata from README and other docs automatically using ML and other techniques to reduce the amount of boilerplate work for the developer.
Awesome meta data sources
- CrossRef
Organization building connections between related entities, building a queryable graph.
- Microsoft academy graph
All the data and links from Mircosoft academy (shutdown end of 2021).
- Openaire graph
All metadata contained in the openaire graph.
- Scholix
A schema for scholarly links. Implemented and deployed by several scholarly link providers.
Metadata formats and standards
- Data Catalog (DCAT)
RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web.
- DataCite
Metadata schema developed by international community with increasing adoption by repositories.
- Dublin Core Metadata Initiative Terms
Dublin Core Metadata Element Set, is a set of fifteen "core" elements for describing resources.
- HDO - Helmholtz Digitization Ontology
HMC developed ontology according to OBO principles with covering concept of digital resources and data management.
- JSON LD Playground
Convert JSON-LD data between various representations.
- JSON Schema
Standard for the description of structural constraints in order to do validation of JSON objects.
Finding datasets and software
- Datacite commons
Search through the metadata indexed by Datacite.
- EuDat B2find
Search through metadata of datasets accumulated by EuDat.
- Microsoft academy
Mircosoft academy search through a pid graph created by microsoft (shutdown end of 2021).
- OpenAIRE explorer
Search through the metadata indexed by openaire.
- Research Software Repository
Aggregates research software from various sources with information about the problem it solves and its scientific domain.
- Schole explorer
A data literature interlinking service (former scholix), indexes links between data and journal publications. It also provides interfaces and APIs to query the graph.
Showing a sample of 68 resources. View the full list on GitHub →