Product boundary: one-page overview (revised)

What this product is

The product is a hosted publishing platform for data, analogous to Substack for writing or a Statista for indie data publishers: opinionated, constrained, and end-to-end. Users do not “build” data portals; they publish data posts (datasets, data-driven stories or data links) inside a data publication. The value is simplicity, distribution, and monetisation, not flexibility. Data posts are first-class, social, subscribable objects, with built-in distribution and monetisation.

The core object is a data publication, which contains a stream of data posts. A data post can take one of a few closely related forms:

  • a dataset-first post (dataset with minimal narrative), or
  • a data story (narrative text supported by embedded datasets, charts, or previews).
  • a data link (a single URL with a title and description)

Both forms share the same constrained structure and publishing workflow. (in fact, from a user perspective there is just a data post and you can just add data files to that if you want a dataset, or a URL if you want a data link).

Across all flows, the recurring job is the same: reduce the cognitive and operational cost of publishing data to near zero, while preserving distribution, social proof, and monetisation.

What this product is not

  • Not a general data portal (CKAN, Socrata)
  • Not a notebook or analysis environment (Jupyter, Kaggle)
  • Not a BI or dashboarding tool
  • Not a low-level storage service

Any functionality that increases configuration, schema design, or presentation choice is out of scope for MVP.

Core user promise

“I can publish a dataset or a data-driven story with almost no setup, and immediately get a clean URL, previews, subscribers, and (optionally) revenue.”

Atomic units

  • Publication: a named, subscribable container with its own URL
  • Data post: a constrained post that combines data and narrative, ranging from dataset-only to story-led

Primary value drivers

  • Extreme ease of publishing
  • Constrained, legible presentation of data and stories
  • Distribution (feeds + email)
  • Social proof (likes, follows)
  • Monetisation (subscriptions)

If you’re happy with this framing, the next step would be to slightly refine the Data posts and ingestion component to explicitly support “dataset-first” and “story-first” variants under a single post model, and then pick that component (or Publications) to spec in executable detail.