Purpose unfolding now.
A blockchain for the world's information.
Direct access to a provenance-stamped knowledge graph for frontier AI companies, research institutions, and infrastructure builders.
Telosima mints entities at scale — domains measured against the full vocabulary of human language, timestamped, provenanced, with graph edges discovered deterministically.
Most web crawl datasets are noisy, unstructured, and impossible to verify. Telosima is different.
Every entity carries full provenance. Discovery timestamp, mint timestamp, source URL, content hash, recrawl history. Every measurement is falsifiable. You know where the data came from and when it was captured.
Deterministic measurements, labeled outputs. Schema scores are calculated against the full vocabulary. Semantic fingerprints run across 42 language dictionaries. Generated outputs are labeled as generated with inputs shown. No black box.
Graph edges discovered through measurement. Common and uncommon edges form deterministically. No manual tagging. No subjective classification. Relationships emerge from accumulated measurements.
Temporal attestation. Entities recrawl on schedule. Every pass generates a new timestamped record. Previous data archives in entity folders. See how websites change over time.
Multilingual by design. 42 language dictionaries. Semantic overlap patterns across language families. Linguistic bias measurements. This is the only web dataset measuring knowledge organization beneath the language layer.
API design is in progress. Early access partners will shape the specification.
Planned endpoints:
/entities — Query minted entities by domain, TLD, schema score, language matches/entities/{id} — Fetch full entity data including manifest, Root-LD, folder contents/starships — List available starships (schema, languages, TLDs)/starships/{id} — Fetch starship data including all appended entities/starjets/{id} — Fetch starjet data (individual vocabulary nodes)/graph/edges — Query graph edges by type, confidence, entity pairs/search — Fuzzy search across all minted entitiesResponse format: JSON-LD with Root-LD wrapper. Machine-readable, traversable, provenanced.
Rate limits: TBD based on tier (research, commercial, enterprise).
Authentication: API key + OAuth for enterprise tiers.
Early access is available for frontier AI companies, research institutions, and infrastructure builders.
If you're building something that needs citation-grounded, provenanced knowledge at scale, reach out.