Properties and patterns for documenting the internal structure of datasets, including variables, data types, physical layout, and value domains. This repository develops the CDIF data description and integration profile using DDI-CDI (Data Documentation Initiative - Cross Domain Integration) concepts mapped to schema.org JSON-LD.
- Documents/CDIF-DescribingDatasetStructure.md — Discussion of how to describe structured data (arrays, hierarchical, dimensional) using DDI-CDI concepts
- psdi-metadata-cdif-implementation.md — How PSDI (Physical Sciences Data Infrastructure) implements CDIF metadata recommendations
Source data files in various formats used for testing DDI-CDI JSON-LD documentation:
SimpleSample.csv/.jsonld— Simple tabular CSV with CDI metadataxdi_example_ss.xdi/nonxafs_2d.xdi— X-ray Data Interchange (XDI) format filesTherm_6_2.hdf— HDF5 data file20231120_002_1mg_Murchison_Smithsonian.cdf— CDF (Common Data Format) filecsvTable-DDICDI.jsonld— CSV table documented with DDI-CDI
CDI-DDI metadata instances documenting the example data, in JSON-LD format:
CDIF-XAS-FullExample.jsonLD— Full XAS (X-ray Absorption Spectroscopy) metadataCDIFmin-XAS.jsonLD— Minimal XAS metadataESS11-subset_DDICDI.jsonld— European Social Survey subsetWeatherObsKeyValue_DDICDI.jsonld— Weather observations in key-value formatse_na2so4-XDI-CDI-CDIF.jsonld— Sodium selenate XAS with CDI data structureHealthResearchdata-WorldFairWP7.json— WorldFAIR health research data
Additional metadata examples organized by domain:
CDIF2026/— Current CDIF 2026 schema examplesFeXAS/— Iron X-ray absorption spectroscopyCMIP-NetCDF/— Climate model NetCDF metadataESS11/— European Social Survey- SDMX examples (merchandise trade statistics, Pacific fisheries)
Long/narrow format data examples (NWIS water quality data) with DDI-CDI metadata demonstrating DescriptorComponent and ReferenceValueComponent roles.
X-ray absorption spectroscopy metadata examples with various levels of CDI-CDI detail.
Hierarchical JSON data with CDI-DDI metadata (NWIS water quality as JSON structure).
Python scripts and metadata for reading HDF5 and NetCDF files and extracting structure metadata.
EC_GeoCodes_encodingFormat.xlsx— Compilation of file formats from EarthCube GeoCodes catalog, scoping the spectrum of data serialization schemesSDMX_3-1-0_SECTION_2_FINAL.pdf— SDMX statistical data exchange standardPhysicalDataset2025.xmi— UML model for physical dataset structurenewModel.qea— Enterprise Architect model file
The archive/ directory contains earlier versions of XAS mapping documents, the ADA-CDIF Reader tool, presentation materials, and working documents from the CDIF-for-XAS development.
- cdif-core — CDIF Core profile (base properties)
- Discovery — CDIF Discovery profile (spatial, temporal, variables)
- metadataBuildingBlocks — Building block schemas including CDIFDataDescriptionProfile, cdifDataDescription, cdifTabularData, cdifLongData, cdifDataCube
- validation — Validation tools (JSON Schema, SHACL, framing)
See LICENSE.