Migrate from existing Amundsen
Migrating from Amundsen involves three main steps:
1 – Migrating existing metadata from Amundsen to Stemma
In this step, you take all the existing metadata from Amundsen and load it into Stemma. The metadata imported includes, but is not limited to:
- Table and column descriptions
- Tags
- Ownership information
To import this metadata, upload a dump of your Amundsen neo4j
metadata into Stemma:
- Bring the Amundsen storage offline.
- Use
dump -> load
, available on allneo4j
installations, to dump and upload your data.
After import, your Stemma instance will be up to date with your Amundsen instance. However, Stemma has a significant number of additional features so the layout will appear different. Check out the quick start page here:

But your data won’t be periodically updated until you enable periodic updates; see the next step.
2 – Ingesting metadata continuously
You need to configure Stemma to ingest metadata updates on an ongoing basis. Once this configuration is done, you won’t need Python databuilder
jobs to ingest data.
Information ingested in this step includes:
- Table and column names
- Linked issues (JIRA tickets)
- Frequent users
- Lineage, if it exists
IIn Stemma’s Admin interface, provide credentials to your data sources to allow Stemma access to extract metadata on an ongoing basis:
Stemma will now do the work to “upsert” metadata updates, as they occur, into the data you uploaded from Amundsen in the previous step.
3 – Cut Over
At this point, Amundsen and Stemma are running side-by-side, and Stemma recommends sharing access to Stemma with your power users and getting their feedback to ensure all of their use-cases are supported. Usually one week of overlap to obtain feedback and an additional week to incorporate feedback, if applicable, is sufficient.
When all looks good, simply redirect the URL of your internal Amundsen to the Stemma URL.
If you have any feedback or questions about moving from Amundsen to Stemma integration, please reach out to s[email protected]!