Product · Data Management

Catalog, lineage, and lifecycle for every dataset.

A unified metadata layer that automatically discovers, classifies, and tracks every dataset across your environment — providing answers to where data came from, who can access it, and how it has changed.

folder_managed
Catalog Sync: Live
account_tree

Automatic Lineage

Column-level lineage captured by the engine itself — no manual annotation, no broken links, always current with your pipelines.

label

Smart Classification

ML-driven discovery tags PII, regulated data, and business-critical assets the moment they enter the platform.

history

Lifecycle Policies

Define retention, archival, and purge rules once — they propagate to every storage tier and replica automatically.

Real-time
Catalog Refresh
60+
Asset Types
AI-Driven
PII Detection
Column-level
Lineage Depth

Know your data. Trust your data.

When every analyst, engineer, and AI agent works from the same catalog, decisions stop being a guessing game. Data Management is the foundation that makes governance, sharing, and compliance possible at scale.

Catalog Specification

Reference architecture for the metadata layer: schema, APIs, lineage capture, and classification policies.

Request Documentation