Dataset

A collection of data published as a unit. A specialization of gmeow:Work.

Structure

Subclass of: gmeow:Work

Practical Pattern

Use gmeow:Dataset as a specialized kind of gmeow:Work. Add statement metadata or a standpoint when the assertion needs provenance, confidence, or vantage.

Example Snippets

These snippets are generated from canonical slice examples and trimmed to the Turtle blocks where this term appears.

Lillith Dataset

# SPDX-FileCopyrightText: 2026 Blackcat Informatics® Inc. <paudley@blackcatinformatics.ca>
# SPDX-License-Identifier: CC-BY-4.0
#
# The dataset descriptor for the Lillith worked example: the
# gmeow:Dataset node that the research-object exports (Croissant, RO-Crate,
# DCAT, DataCite, Frictionless) read their catalog metadata FROM — title,
# description, licence, attribution, publication date. Canonical instance
# data; every export is a generated lossy projection of it (P4/P5).
@prefix gmeow: <https://blackcatinformatics.ca/gmeow/> .
@prefix ex:    <https://blackcatinformatics.ca/gmeow/examples/graphrag/> .
@prefix rdfs:  <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd:   <http://www.w3.org/2001/XMLSchema#> .

ex:lillith-benchmark a gmeow:Dataset ;
    rdfs:label "Lillith GraphRAG benchmark"@en ;
    gmeow:title "Lillith GraphRAG benchmark"@en ;
    gmeow:description "A worked GraphRAG benchmark dataset: a content-addressed corpus, its chunking, embeddings, vector index, retrieval events, and model-extracted entity/relationship descriptions — every artifact attributed and confidence-weighted, published as a research object."@en ;
    gmeow:hasPart ex:corpus-lillith ;
    gmeow:hasLicense ex:lillith-license ;
    gmeow:wasAttributedTo ex:blackcat ;
    gmeow:datePublished "2026-06-12T00:00:00Z"^^xsd:dateTime ;
    gmeow:sourceLocation "https://blackcatinformatics.ca/gmeow/examples/graphrag/lillith-benchmark" .

Data Pipeline

ex:rawEvents a gmeow:Dataset ; gmeow:title "Raw event log"@en .

Common Companion Terms

gmeow:Work

Projects To

Profile External Targets
dcat dcat, dcterms, rdf
schema-org schema

External Equivalences

Equivalent or closely aligned targets: dcmitype, qb, schema, wd

Linkages

Generated from the canonical mapping DSL. SSSOM files are the generated public interchange form for term equivalences.

Term Equivalences

Source Kind Profile Predicate/Relation Target Evidence
gmeow:Dataset equivalence - skos:closeMatch dcmitype:Dataset gmeow-dublin-core.sssom.tsv; gmeow:eqDcType002; confidence 0.9
gmeow:Dataset equivalence - skos:closeMatch qb:DataSet gmeow-aggregation.sssom.tsv; gmeow:eqAgg002; confidence 0.9
gmeow:Dataset equivalence - owl:equivalentClass schema:Dataset gmeow-classes.sssom.tsv; gmeow:eqClasses023; confidence 1
gmeow:Dataset equivalence - skos:closeMatch wd:Q1172284 gmeow-wikidata.sssom.tsv; gmeow:eqWikidata034; confidence 0.85

Projection Coverage

Source Kind Profile Predicate/Relation Target Evidence
gmeow:Dataset projection dcat projects to / = dcat:Dataset gmeow:mapDcatDataset; confidence 0.95
gmeow:Dataset projection dcat projects to / <= dcat:Distribution, dcat:distribution, dcat:downloadURL, rdf:type gmeow:mapDcatPartDistribution; confidence 0.8; lossy: generic parthood narrows to the dataset-distribution reading; non-distribution parts are over-typed
gmeow:Dataset projection dcat projects to / = dcterms:description gmeow:mapDcatDescription; confidence 0.9
gmeow:Dataset projection dcat projects to / = dcterms:issued gmeow:mapDcatIssued; confidence 0.9
gmeow:Dataset projection dcat projects to / <= dcterms:license gmeow:mapDcatLicense; confidence 0.9; lossy: the License agreement's deontic structure (Permission/Prohibition/Duty rules) collapses to a flat license edge
gmeow:Dataset projection dcat projects to / = dcterms:title gmeow:mapDcatTitle; confidence 0.9
gmeow:Dataset projection schema-org projects to / = schema:Dataset gmeow:mapSchemaDataset; confidence 0.95

Usage Advice

Use when

Avoid when

How to use

Examples

Published Alignment Graph

Alignments

RelationTarget
closeMatchdcmitype:Dataset
closeMatchqb:DataSet
closeMatchwd:Q1172284
equivalentClassschema:Dataset