Skip to content

design: experimentation UI screens (Phase 1 + Phase 2) #7181

@talissoncosta

Description

@talissoncosta

Context

Design the UI screens for the warehouse-native experimentation feature across both phases. Designs are being built in Pencil. This is internal-first (dogfood on Flagsmith SaaS, behind feature flag).

PRDs

Phase 1 Screens

Warehouse Connection Settings (P0)

  • Location: Organisation Settings → Data Warehouse (new section)
  • States: empty → configuration form → testing → connected → error
  • Warehouse type selector (Snowflake active, others coming soon)
  • Credential fields, test connection, status indicator

Streaming Status (P1)

  • Events streamed (24h/7d), current status, last event timestamp, error log
  • Mostly for engineering/debugging during dogfood

Environment Streaming Filter (P2)

  • Per-environment toggles: stream evaluations on/off, stream custom events on/off

Phase 2 Screens

Experiment Creation Flow (P0)

  • 4-step wizard: Basics (name, hypothesis, flag) → Variants (control selection) → Metrics (attach primary metric) → Review & Create
  • Entry from flag detail page or Experiments nav

Metric Definition UI (P0)

  • SQL editor with syntax highlighting
  • Validate SQL button (dry run against warehouse)
  • Column mapping (auto-detected, manually adjustable)
  • Type selector: Conversion / Numeric / Count

Experiment Detail Page (P0)

  • Header with status badge and lifecycle actions (Start, Pause, Stop, Record Decision)
  • Sections: Overview, Metrics, SRM Check, Results (Phase 3 placeholder), Decision

Experiment List (P1)

  • Filter by status, sort by date/name
  • Experiment catalogue showing name, status, flag, primary metric, duration, decision

Flag Detail — Experiment Link (P1)

  • Show linked experiment card on flag detail page
  • CTA to create experiment if none exists (only for multivariate flags)

Key Design Principles

  • The flag is always the source of truth for variants and traffic split
  • Experiment is a layer of intent and measurement on top of the flag
  • Hypothesis field should feel intentional, not like a chore
  • SQL editor is the core of metric definition — needs to be good
  • Status transitions should feel deliberate (confirmations on Start, Decision)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions