New Paper in Scientific Data: How CEDAR Enables Standardized Metadata Across HuBMAP—and Now SenNet

We’re pleased to announce the publication of a new paper in Scientific Data titled
“Ensuring Adherence to Standards in Experiment-Related Metadata Entered Via Spreadsheets”
(Read the paper).

This paper outlines how the CEDAR Workbench has played a central role in ensuring that experiment-related metadata collected by the HuBMAP consortium adheres to FAIR principles and community standards—despite being entered by researchers using spreadsheets. It also describes the expansion of this approach into the SenNet program, demonstrating how metadata consistency and quality can scale across large, decentralized biomedical research consortia.

The Challenge: Researcher-Friendly Interfaces with Standards-Driven Metadata

Biomedical consortia like HuBMAP face a major challenge: researchers need to enter metadata that is rich, precise, and standards-compliant—but they overwhelmingly prefer working in spreadsheets. This poses a tension between usability and data quality.

Rather than force researchers to abandon familiar tools, the CEDAR team collaborated with HuBMAP to design a solution that lets researchers continue using spreadsheets while ensuring the underlying metadata adheres to formal standards.

CEDAR-Powered Metadata Management in HuBMAP

The paper details how HuBMAP used the CEDAR Workbench to create more than 30 structured metadata templates covering biospecimen collection, assays, and tissue processing. These templates are:

Backed by community ontologies like OBI, UBERON, CL, and EDAM
Semantically explicit, with field-level constraints, validation, and ontology term selection
Output in JSON-LD, enabling downstream integration and discovery

CEDAR’s infrastructure then generates annotated spreadsheet templates from these metadata forms. Researchers fill these out in Excel or Google Sheets, but each cell corresponds to a well-defined field from the CEDAR metadata model.

Once filled, the spreadsheets are programmatically validated and converted back into structured JSON-LD via the CEDAR API—ensuring data integrity while accommodating researchers’ workflow preferences.

Integration Across HuBMAP

The result is a full-stack metadata ecosystem where:

Templates are collaboratively authored in CEDAR
Researchers fill out intuitive spreadsheets
Data managers validate and ingest the metadata automatically
Outputs are standards-aligned and machine-actionable

This approach is now operational across HuBMAP’s 60+ data contributors, ensuring consistency without sacrificing usability.

Scaling to SenNet

The SenNet consortium, which focuses on senescent cell profiling across human tissues, is now adopting this same strategy. CEDAR-based templates have been adapted for SenNet’s unique metadata needs, and the spreadsheet-based entry pipeline is being reused to accelerate onboarding and minimize errors.

The paper outlines how early SenNet metadata—also entered via spreadsheets—is already conforming to ontology-backed, reusable formats via CEDAR’s validation and transformation tools.

A Practical FAIR Metadata Blueprint

This work shows that it’s possible to marry the simplicity of spreadsheet entry with the rigor of standards-based metadata. By placing CEDAR at the center of this pipeline, both HuBMAP and SenNet have made structured metadata entry accessible, scalable, and FAIR-compliant.

For other research networks grappling with metadata chaos, this publication offers a compelling blueprint: standards adherence and ease-of-use don’t have to be tradeoffs.

The Challenge: Researcher-Friendly Interfaces with Standards-Driven Metadata

CEDAR-Powered Metadata Management in HuBMAP

Integration Across HuBMAP

Scaling to SenNet

A Practical FAIR Metadata Blueprint

You might also like