Getting Started
What is Stemma?
Stemma is a data catalog built for users of the modern data stack. It uses automation and workflow integration to help data owners get documentation into the catalog, while making it easy for end users to discover and trust data.
Tools for the Data Team to Document Data
Auto-generated metadata handles basic stewardship so no asset starts blank. Data that is automatically generated includes:
- Column descriptions “autodescribed” via upstream column-level lineage
- Common queries
- Frequent users
Slack integration to capture context where and when it is expressed
WYSIWYG editor to easily annotate assets with text, images, and existing docs
Tools for Analysts to Find Data
Track data lineage in the Lineage Graph
- Focus mode
- Column filtering
Explore and contribute to the Glossary
- By source type
- By tags
Search and filter for columns in Advanced Search
Tools To Keep Data Owners in Contact with Users
Assign assets to Slack channel teams
Communicate with all downstream users of an asset prior to changes
How to be Successful with Stemma

Kickstart documentation with automated data stewardship

Manage data changes with advanced lineage tools

Enhance existing user workflows by integrating the tools you already use
Articles
- Integrating Your Applications with Stemma: What We Need from You
- Athena Integration
- Azure Active Directory Integration
- BigQuery Integration
- dbt integration
- Delta Lake Integration
- Google OIDC Integration
- Hive Integration
- JIRA Integration
- Looker Integration
- Mode Integration
- Okta Integration
- Redshift Integration
- Slack Integration
- Snowflake Integration
- Tableau Integration
- Connecting to AWS S3
- Guidelines for a Successful Stemma Rollout
- Migrate from existing Amundsen