Causeway · data platform

The paved path, across the lake.

Every dataset gets a contract, a steward, and a lit route from raw to gold. Causeway is the paved path for data creators.

Onboard with AutobahnLaunch CausewayRead the docs
What Causeway is

Three piers hold the road up.

01
Contract
Every dataset declares its schema, its freshness, its retention, and its policies. If it breaks, we tell you before downstream does.
02
Lineage
Upstream to consumption, the route is mapped. Semantic lineage that follows the meaning, not just the bytes.
03
Governance
Classification, consent, and AI-use rules live beside the data. One place to audit, one place to answer for.
Contract
The deck rests on piers, one per promise.
Lineage
Every span traceable, end to end.
Governance
Load-bearing, by design.
Articles
All articles →

Field notes from the road.

Start the walk

Your first dataset reaches Gold this quarter.

Launch CausewayTalk to sales
FAQ

Questions from the road.

01
What is a causeway, in this context?
+
A causeway is a road built across a body of water. In Causeway-the-platform, it is the paved path from raw data in the lake to a governed, contracted dataset that downstream can trust.
02
Do I have to move my data?
+
No. Causeway sits over your existing lake or warehouse (Snowflake, Databricks, BigQuery, Iceberg) and governs in place. You declare contracts on tables you already have.
03
What happens when a contract breaks?
+
Downstream consumers are notified, the dataset is marked Broken Contract in red, and stewards are paged. Promotions to Gold are blocked until the contract is healed.
04
How does RAG classification work?
+
Three tiers: Restricted (mask or redact PII), Internal (no exports), Public (no masking). Every dataset carries one tier. One masking policy per tier. We deliberately refuse a taxonomy of fifty.
05
Can Causeway govern AI use of my data?
+
Yes. Each dataset declares per-right consent for RAG retrieval, fine-tuning, and external LLM token export. Violations pause model training in the affected region.
06
Is Causeway open source?
+
The contract spec and lineage protocol are open. The platform itself is a commercial product; pricing is per million rows governed, with volume tiers.
Resources

Everything you need to build a path of your own.

About Causeway · for the people who build with our data

"Our data platform used to be a lake you had to swim. Causeway is the road we're paving across it: for our creators, by our creators."

Causeway is how we're evolving our internal data platform from a warehouse you file tickets against into a self-service surface our creators actually want to build on. Analysts, scientists, engineers, product operators: anyone who turns data into decisions is a creator here, and the platform works for them first.

Usability is the point. Every dataset ships with a contract, a steward, and a lit route from raw to gold, so a new creator can promote their first table this week instead of next quarter. No ticket queues. No tribal knowledge. The paved path is the fast path.

AI readiness isn't a roadmap item: it's the deck we're building on. Contracts declare consent for retrieval, fine-tuning, and export. Lineage follows meaning, not bytes. When a creator here ships an agent or a model, the data under it is already governed, already trusted, already theirs to use.

1.2K
Creators onboarded
340
Gold datasets
4 wks
Avg. time to Gold
v2.1
Platform release