DataHub Metrics Dashboard
Purpose
Measure whether DataHub is improving successful value extraction from existing demand over a 90-day optimization phase.
Focus: use before growth.
Core Definition (used everywhere)
Data Extraction
A user action that produces a reusable data artifact.
Count exactly these events:
-
Full dataset download (CSV, XLS, etc.)
-
Sample / filtered dataset download
-
Visualization export (PNG, SVG, PDF)
-
Copy-to-clipboard of tabular data (if available)
Do not count:
-
Page views
-
Scrolls
-
Visualization views without export
-
Hover or passive interactions
This definition replaces “downloads” throughout the dashboard.
North-Star Metric
Data Extractions per 1,000 Dataset Page Views
Rationale: best proxy for successful task completion in data-seeking contexts.
Tier 1: Core Page-Level Metrics (must have)
Per dataset page:
-
Page views
-
Unique visitors
-
Total data extractions
-
Data extractions / 1,000 views
-
Bounce rate
-
Median time on page
Interpretation:
-
High views + low extractions → relevance or usability failure
-
Very low time on page (<10s) → scope mismatch or misleading search intent
Tier 2: Extraction Breakdown (diagnostic)
Tracked separately, not optimized separately (initially):
-
Raw dataset downloads
-
Sample / filtered downloads
-
Visualization exports
Purpose:
-
Understand how users extract value
-
Inform later product and pricing decisions
Tier 3: Engagement & Depth (supporting signals)
Per session:
-
Preview interactions (table scrolls, schema views, chart interactions)
-
Related-dataset clicks
-
Multi-dataset sessions (% sessions with ≥2 dataset pages)
These predict future monetization but are not the primary KPI.
Tier 4: Commitment Metrics (observed, not driven)
Tracked but explicitly secondary:
-
Account creations
-
Dataset subscriptions / follows
-
Purchases
These should lag extraction improvements.
Required Segment Filters
Dashboard must support filtering by:
-
Top 20 dataset pages by traffic
-
Free vs paid datasets
-
New vs returning visitors
Optional (later):
-
Device type
-
Traffic source (search / referral)
Reporting Cadence
-
Weekly review: page-level extraction performance
-
Monthly checkpoint: are extraction rates improving on top pages?
Success in this phase = higher extraction density, not more traffic.
One-Line Strategy Reminder (for the dashboard header)
“If users cannot extract value, nothing else matters.”
Next, we can revise the dataset-page audit checklist to align exactly with this extraction-based metric (e.g. every checklist item maps to improving extraction probability).