Corpus

An indexed collection of source information objects over which retrieval operates — the working document set behind a pipeline, distinct from the documents slice's bibliographic gmeow:Collection (a Work). Members via gmeow:corpusMember; content-addressable via the EXISTING gmeow:contentDigest, so an index can prove WHAT it indexed.

Structure

Subclass of: gmeow:InformationObject

Practical Pattern

Use gmeow:Corpus as a specialized kind of gmeow:InformationObject. Add statement metadata or a standpoint when the assertion needs provenance, confidence, or vantage.

Example Snippets

These snippets are generated from canonical slice examples and trimmed to the Turtle blocks where this term appears.

Lillith Pipeline

ex:corpus-lillith a gmeow:Corpus ;
    rdfs:label "Lillith working corpus"@en ;
    gmeow:corpusMember ex:mail-archive ;
    gmeow:contentDigest "blake3:0123456789abcdef0123456789abcdef0123456789abcdef0123456789abcdef" .

Common Companion Terms

gmeow:InformationObject

Projects To

Profile External Targets
dcat dcat

External Equivalences

Equivalent or closely aligned targets: schema, wd

Linkages

Generated from the canonical mapping DSL. SSSOM files are the generated public interchange form for term equivalences.

Term Equivalences

Source Kind Profile Predicate/Relation Target Evidence
gmeow:Corpus equivalence - skos:closeMatch schema:Dataset gmeow-graphrag.sssom.tsv; gmeow:eqGr002; confidence 0.7
gmeow:Corpus equivalence - skos:closeMatch wd:Q461183 gmeow-graphrag.sssom.tsv; gmeow:eqGr001; confidence 0.8

Projection Coverage

Source Kind Profile Predicate/Relation Target Evidence
gmeow:Corpus projection dcat projects to / <= dcat:Dataset gmeow:mapDcatCorpus; confidence 0.85; lossy: the corpus's retrieval role (index membership, embedding lineage) is invisible to DCAT — only the dataset facet survives
Published Alignment Graph

Alignments

RelationTarget
closeMatchschema:Dataset
closeMatchwd:Q461183