Dataset
- CURIE:
gmeow:Dataset - IRI: https://blackcatinformatics.ca/gmeow/Dataset
- Category: class
- Defined by:
gmeow:slices/documents - Box roles: TBox role (What is this?)
A collection of data published as a unit. A specialization of gmeow:Work.
Structure
Subclass of: gmeow:Work
Practical Pattern
Use gmeow:Dataset as a specialized kind of gmeow:Work. Add statement metadata or a standpoint when the assertion needs provenance, confidence, or vantage.
Example Snippets
These snippets are generated from canonical slice examples and trimmed to the Turtle blocks where this term appears.
Lillith Dataset
- Source:
slices/extensions/graphrag/examples/lillith-dataset.ttl - Examples catalog: open in catalog#example-slices-extensions-graphrag-examples-lillith-dataset
# SPDX-FileCopyrightText: 2026 Blackcat Informatics® Inc. <paudley@blackcatinformatics.ca>
# SPDX-License-Identifier: CC-BY-4.0
#
# The dataset descriptor for the Lillith worked example: the
# gmeow:Dataset node that the research-object exports (Croissant, RO-Crate,
# DCAT, DataCite, Frictionless) read their catalog metadata FROM — title,
# description, licence, attribution, publication date. Canonical instance
# data; every export is a generated lossy projection of it (P4/P5).
@prefix gmeow: <https://blackcatinformatics.ca/gmeow/> .
@prefix ex: <https://blackcatinformatics.ca/gmeow/examples/graphrag/> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
ex:lillith-benchmark a gmeow:Dataset ;
rdfs:label "Lillith GraphRAG benchmark"@en ;
gmeow:title "Lillith GraphRAG benchmark"@en ;
gmeow:description "A worked GraphRAG benchmark dataset: a content-addressed corpus, its chunking, embeddings, vector index, retrieval events, and model-extracted entity/relationship descriptions — every artifact attributed and confidence-weighted, published as a research object."@en ;
gmeow:hasPart ex:corpus-lillith ;
gmeow:hasLicense ex:lillith-license ;
gmeow:wasAttributedTo ex:blackcat ;
gmeow:datePublished "2026-06-12T00:00:00Z"^^xsd:dateTime ;
gmeow:sourceLocation "https://blackcatinformatics.ca/gmeow/examples/graphrag/lillith-benchmark" .
Data Pipeline
- Source:
slices/extensions/procedures/examples/data-pipeline.ttl - Examples catalog: open in catalog#example-slices-extensions-procedures-examples-data-pipeline
ex:rawEvents a gmeow:Dataset ; gmeow:title "Raw event log"@en .
Common Companion Terms
Projects To
| Profile | External Targets |
|---|---|
dcat |
dcat, dcterms, rdf |
schema-org |
schema |
External Equivalences
Equivalent or closely aligned targets: dcmitype, qb, schema, wd
Linkages
Generated from the canonical mapping DSL. SSSOM files are the generated public interchange form for term equivalences.
Term Equivalences
| Source | Kind | Profile | Predicate/Relation | Target | Evidence |
|---|---|---|---|---|---|
gmeow:Dataset |
equivalence | - |
skos:closeMatch | dcmitype:Dataset | gmeow-dublin-core.sssom.tsv; gmeow:eqDcType002; confidence 0.9 |
gmeow:Dataset |
equivalence | - |
skos:closeMatch | qb:DataSet | gmeow-aggregation.sssom.tsv; gmeow:eqAgg002; confidence 0.9 |
gmeow:Dataset |
equivalence | - |
owl:equivalentClass | schema:Dataset | gmeow-classes.sssom.tsv; gmeow:eqClasses023; confidence 1 |
gmeow:Dataset |
equivalence | - |
skos:closeMatch | wd:Q1172284 | gmeow-wikidata.sssom.tsv; gmeow:eqWikidata034; confidence 0.85 |
Projection Coverage
| Source | Kind | Profile | Predicate/Relation | Target | Evidence |
|---|---|---|---|---|---|
gmeow:Dataset |
projection | dcat |
projects to / = | dcat:Dataset | gmeow:mapDcatDataset; confidence 0.95 |
gmeow:Dataset |
projection | dcat |
projects to / <= | dcat:Distribution, dcat:distribution, dcat:downloadURL, rdf:type | gmeow:mapDcatPartDistribution; confidence 0.8; lossy: generic parthood narrows to the dataset-distribution reading; non-distribution parts are over-typed |
gmeow:Dataset |
projection | dcat |
projects to / = | dcterms:description | gmeow:mapDcatDescription; confidence 0.9 |
gmeow:Dataset |
projection | dcat |
projects to / = | dcterms:issued | gmeow:mapDcatIssued; confidence 0.9 |
gmeow:Dataset |
projection | dcat |
projects to / <= | dcterms:license | gmeow:mapDcatLicense; confidence 0.9; lossy: the License agreement's deontic structure (Permission/Prohibition/Duty rules) collapses to a flat license edge |
gmeow:Dataset |
projection | dcat |
projects to / = | dcterms:title | gmeow:mapDcatTitle; confidence 0.9 |
gmeow:Dataset |
projection | schema-org |
projects to / = | schema:Dataset | gmeow:mapSchemaDataset; confidence 0.95 |
Usage Advice
Use when
- Use for a data collection published as a citable unit — a research dataset, statistical release, or open-data corpus at the
gmeow:Worktier.
Avoid when
- Avoid it for an ongoing ordered stream of feed items (use
gmeow:DataFeed) and avoid it for a single downloadable file of the dataset (that is agmeow:Manifestation/ distribution).
How to use
- Type the dataset
gmeow:Dataset, carry its DOI ongmeow:identifierandgmeow:hasTitle/ bibliographic dates; expose concrete download files asgmeow:Manifestationcarriers bearinggmeow:mediaType.
Examples
- ex:census a
gmeow:Dataset;gmeow:hasTitleex:censusTitle;gmeow:identifier"doi:10.5061/dryad.xyz".
Published Alignment Graph
Alignments
| Relation | Target |
|---|---|
closeMatch | dcmitype:Dataset |
closeMatch | qb:DataSet |
closeMatch | wd:Q1172284 |
equivalentClass | schema:Dataset |