Corpus
- CURIE:
gmeow:Corpus - IRI: https://blackcatinformatics.ca/gmeow/Corpus
- Category: class
- Defined by:
gmeow:slices/graphrag - Box roles: TBox role (What is this?)
An indexed collection of source information objects over which retrieval operates — the working document set behind a pipeline, distinct from the documents slice's bibliographic gmeow:Collection (a Work). Members via gmeow:corpusMember; content-addressable via the EXISTING gmeow:contentDigest, so an index can prove WHAT it indexed.
Structure
Subclass of: gmeow:InformationObject
Practical Pattern
Use gmeow:Corpus as a specialized kind of gmeow:InformationObject. Add statement metadata or a standpoint when the assertion needs provenance, confidence, or vantage.
Example Snippets
These snippets are generated from canonical slice examples and trimmed to the Turtle blocks where this term appears.
Lillith Pipeline
- Source:
slices/extensions/graphrag/examples/lillith-pipeline.ttl - Examples catalog: open in catalog#example-slices-extensions-graphrag-examples-lillith-pipeline
ex:corpus-lillith a gmeow:Corpus ;
rdfs:label "Lillith working corpus"@en ;
gmeow:corpusMember ex:mail-archive ;
gmeow:contentDigest "blake3:0123456789abcdef0123456789abcdef0123456789abcdef0123456789abcdef" .
Common Companion Terms
Projects To
| Profile | External Targets |
|---|---|
dcat |
dcat |
External Equivalences
Equivalent or closely aligned targets: schema, wd
Linkages
Generated from the canonical mapping DSL. SSSOM files are the generated public interchange form for term equivalences.
Term Equivalences
| Source | Kind | Profile | Predicate/Relation | Target | Evidence |
|---|---|---|---|---|---|
gmeow:Corpus |
equivalence | - |
skos:closeMatch | schema:Dataset | gmeow-graphrag.sssom.tsv; gmeow:eqGr002; confidence 0.7 |
gmeow:Corpus |
equivalence | - |
skos:closeMatch | wd:Q461183 | gmeow-graphrag.sssom.tsv; gmeow:eqGr001; confidence 0.8 |
Projection Coverage
| Source | Kind | Profile | Predicate/Relation | Target | Evidence |
|---|---|---|---|---|---|
gmeow:Corpus |
projection | dcat |
projects to / <= | dcat:Dataset | gmeow:mapDcatCorpus; confidence 0.85; lossy: the corpus's retrieval role (index membership, embedding lineage) is invisible to DCAT — only the dataset facet survives |