Elementary Data
AGAGAdam Goddenyu

Data Lineage: from overlooked feature to primary SaaS converter.

Rebuilding Data Lineage's user journey for the upcoming SaaS release, under a tight deadline.

Data Lineage: from overlooked feature to primary SaaS converter.
Company
Elementary Data icon

Elementary Data

YC-backed, open-source data observability platform serving 10,000+ data engineers and analysts to monitor, trace, and resolve data pipeline issues.

Elementary Data Cloud dashboard - health score overview
RolePrincipal Product Designer
IndustryData Infrastructure / B2B
TeamLead PM / 1 BE / 2 FE
Customers
Context

A well-defined scope is the only way to ship quality results fast.

All company focus shifted to aligning major features for the upcoming SaaS release. Data Lineage was the first feature I chose to work on, a node canvas for tracing pipeline issues and downstream impact.

Conversion driverFlagship feature

Positioned Data Lineage as the primary OSS to SaaS conversion driver, cited by 81% as the main reason.

69% faster1.45x Reduction

Reduced cloud compute costs through optimizing Data Lineage core user flow.

Rapid adoptionFiverr & Elementor

3 design partners converted to paid versions within weeks of private launch.

Community favorite

Repeatedly voted all-time favorite feature in monthly Slack polls.

The Problem

This is how Data Lineage looked when I joined.

With a two-week timeline to deliver and move to the next feature, I prioritized a focused strategy over a broad one to maintain quality.

The default view was loading the entire DAG, as is.
Data lineage graph before redesign
UX Process

Non-traditional discovery cycle, validated on the go.

Posthog showed strong page-load volume but sessions were short and visibly confused. With no time for a full discovery cycle, I validated on the go through quick prototypes, demos, and direct sessions with data professionals and the open-source community.

Qualitative

The investigation mental model

12 data professionals interviews at Elementor & Fiverr:

Most of the time, as an investigation starting point, they know which model they search for to begin with.
For exploration purposes, they need easy navigation between assets.
They want to group models by type.
Quantitative

Main pain points at scale

Open-source Slack community feedback, ~5K members:

Laggy, long loading times caused by rendering large DAGs.
Hard to navigate or locate nodes quickly.
Data lineage is highly desired but unusable, column-level lineage is a critical missing piece.
Main Learnings

Data Lineage lacked an opinionated user experience.

According to research, success meant a user should be able to start from a known model, trace a path to the root cause, and never feel lost or slowed down by the tool itself.

Investigations are progressive

Usually, analysts trace paths from a known table to identify the root cause.

Diagram showing progressive investigation path from issue to root cause

Too much visibility

Loading the full DAG by default is unhelpful and causes performance issues.

Screenshot showing full DAG rendering performance issue

Remaining within context

Current experience lacked tools to refocus starting points during investigations.

Screenshot showing lack of context-switching during investigation
Solution

Designing for progressive investigation, not full visibility.

01

Model-first journey

Users select a model first, then see a focused DAG by default.

Model-first journey: user selects a model and sees a focused DAG
02

File tree with search & filters

Enables quick model location, matching the database structure.

File tree with search and filters for quick model location
03

Node filters, depth controller & direction stepper

Users can group by model type, choose levels to display, and toggle upstream/downstream.

Node filters, depth controller and direction stepper controls
04

Node-level actions

Added contextual actions so users can trace issues without losing context.

Node-level contextual actions panel
05

Column-level lineage

Added visibility layer to follow column connections, a SaaS feature.

Column-level lineage showing column connections across models
Trade-offs

Beyond the scope

These features were designed and validated extensions of Column-level lineage, a SaaS-only feature. They were deprioritized to focus on the best possible open-source experience first.

Column name search

Was developed later
Column name search trade-off

Test results on columns

Was developed later
Test results on columns trade-off

Visible column types

Wasn't developed yet
Visible column types trade-off

Personal note.

Working on a true open-source changed how I approach to research. With 5K+ users giving unfiltered feedback in hours, I learned that speed and quality are not a tradeoff. They are both a result of knowing exactly what problem you are solving.

Elementary Data app screenshot