Skip to content

Open Science Catalog (OSC)

Introduction

The Open Science Catalog (OSC) is a key component of the ESA EO Open Science framework. It is built on the Spatio Temporal Asset Catalog (STAC), which is a standardised format for describing geospatial data. The catalog captures information about Projects, Products, Workflows, and Experiments, and their relationships to ESA Themes, Variables, EO missions. These elements contain information and direct links to the corresponding research outcomes, which are themselves located in external storage providers.

Users can browse and explore these interlinked elemennts throguht the webrowser, API or directly through the data itself. See Data Discovery and Access for more information.

Adding / updating entries.

The different ways to add/update/remove entries from the catalog are described in the Contributing to the OSC section.

At a core level, each update to metadata is handled via a Pull Request (PR).

This Pull Request allows for reviewers to see the changes to be applied in advance, to check for validity of the requested changes (via an automated validation script) and to provide reviews as comments.

Structure

The Open Science Catalog is a deployment of several EOEPCA components, in combination with additional supplementary components. In this section we focus on the metadata and its structure, as that is what most users will need to work with. You can see the full technical architechture of the Open Science Catalog here.

The Open Science Catalog metadata is a STAC catalog comprised of json files, with specific attributes and structure that together describe its elements - Themes, Variables, EO Missions, Projects, Products, Workflows, and Experiments. All files are stored directly on Github [here] (https://github.com/ESA-EarthCODE/open-science-catalog-metadata/tree/main), as they only contain metadata and links and not the actual data in the products, or the code in the workflows.

Detailed information about Projects, Products, Workflows, and Experiments is available in the tutorial which shows how to manually create files for the OSC - here .

Projects

Projects are the containers that have the top level information about your work. It is the first type of information you should provide. Typically an OSC project corresponds to a project financed by the European Space Agency - Earth Observation programme. Before creating new project, check if your project is not already on the list of onboarded projects. In such case you can use your project entry and only update it where needed.

FieldDescriptionSTAC representation
Project_IDNumeric identifier
Status“ongoing” or “completed”osc:status property
Project_NameNametitle property
Short_Descriptiondescription property
Websitelink
Eo4Society_linklink
Consortiumcontacts[].name property
Start_Date_Projectextent.temporal[] property
End_Date_Projectextent.temporal[] property
TOcontacts[].name property
TO_E-mailcontacts[].emails[].value property
Theme1 - Theme6Theme identifiersosc:themes property

Metadata of each project is stored in a folder named after their unique id (collectionid). Each folder has one file - collection.json that has all the project information (metadata).

In addition to specifying the links within the project collection.json entry (created above), you should also add an entry in the parent catalog, listing all projects to be correclty rendered into STAC Browser.

Products

Products represent the outputs of you projects and typically reference datasets. Similarly to Projects, they are STAC items and follow similar structure, with some additional fields, improving their findability.

FieldDescriptionSTAC representation
IDNumeric identifier
Status“ongoing” or “completed”osc:status property
ProjectThe project identifierosc:project property, collection link
Websitelink
ProductNamelink
Short_Nameidentifier
Descriptiondescription property
AccessURLlink
DocumentationURLlink
Versionversion property
DOIDigital Object Identifiersci:doi property and cite-as link
VariableVariable identifiercollection link
Startextent.temporal[]
Endextent.temporal[]
Regionosc:region property
Polygongeometry
Releasedcreated property
Theme1 - Theme6Theme identifiersosc:themes property
EO_MissionsSemi-colon separated list of missionsosc:missions property
Standard_Namecf:parameter.name property

In addition to specifying the links from the product to other parts of the catalog, it is required to add the reverse links, as in case of the Project to following elements:

  • From the Product Collection.json to the Catalog.json (listing all products in the OSC)
  • From the associated Project to the Product
  • From the associated EO-Missions catalog to the Product
  • From the associated Variables Catalog to the Product
  • From the associated Themes Catalog to the Product

Workflows

Workflows are the code and workflows associated with a project, that have been used to generate a specific product. Workflows follow OGC record specifications in contrast to OSC Projects and Products entries. However, the metadata of a workflow is also expressed in JSON format.

Field NameDescription
conformsToAn array of URIs indicating which OGC API Records specifications this record conforms to.
typeIndicates the GeoJSON object type. Required to be "Feature" for OGC compliance.
geometrySpatial representation of the item. Set to None here, as it may not be spatially explicit.
linkTemplatesAn array of link templates as per the OGC API. Used for dynamic link generation.
idUnique identifier for the workflow STAC item ('worldcereal-workflow2').
linksList of external and internal references including catalog navigation, project association, theme association, process graph, source code, and service endpoint.
properties.contactsList of individuals or organizations associated with the workflow. Each contact may include name, email, and roles such as technical_officer or consortium_member.
properties.createdTimestamp representing when the workflow was first created (2025-07-14T18:02:13Z).
properties.updatedTimestamp of the most recent update to the workflow (2025-07-14T18:02:13Z).
properties.versionThe version number of the workflow (1).
properties.titleA concise, descriptive title of the workflow: "ESA worldcereal global crop extent detector2".
properties.descriptionA summary of what the workflow does: "Detects crop land at 10m resolution, trained for global use...".
properties.keywordsArray of keywords to support discoverability (e.g., agriculture, crops).
properties.themesArray of themes the workflow relates to. Each entry includes a concepts array with IDs (e.g., 'land') and a scheme URL.
properties.formatsOutput formats of the workflow (e.g., GeoTIFF).
properties.osc:projectProject ID associated with the workflow (worldcereal2).
properties.osc:statusCurrent status of the workflow (e.g., completed).
properties.osc:typeType of OSC object, expected to be workflow.
properties.licenseLicense for the workflow (e.g., 'varuious' – likely a typo for various).

In addition to specifying the links from the workflow to other parts of the catalog, it is required to add the reverse links:

  • From the Workflow record.json to the workflows/catalog.json (listing all workflows in the OSC)
  • From the associated Project to the Workflow
  • From the associated Themes to the Workflow

Themes

Themes describe the Earth Science topics linked to the grand science challenges set in the ESA strategy. This is a fixed list.

FieldDescriptionSTAC representation
themeTheme nameid
descriptionTheme descriptiondescription
linkLink to further resourceslink

Variables

The variables field describes the Geoscience, climate and environmental variables that products and workflows model. There is a fixed list of variables, however if your variable is missing from it you can add it in your Pull Request.

FieldDescriptionSTAC representation
themeThe associated theme nameosc:theme property
theme_description
linkLink to further resourceslink
variableThe variable nameid
domainThe variables domain
variable descriptiondescription

ESA – European Space Agency © 2020-2025