Synthetic Roadmap

Synthetic Market-Data Roadmap

Implementation-sized phases for extracting deterministic synthetic generation into a first-class reusable engine while keeping the useful NATS, ClickHouse, compute, API, replay, and web stack.

Beads islandflow-259 Recommended: Option B Infra-free early gates

Jump to

Core Constraints

  • Emit canonical market event types: OptionPrint, OptionNBBO, EquityPrint, and EquityQuote.
  • Do not create synthetic-only market event types for the main pipeline.
  • Keep hidden ground-truth labels separate from emitted market events.
  • Keep early quality gates infra-free: bun test should not require Docker, ClickHouse, NATS, or Redis.
  • Build deterministic foundations before demos, UI controls, or live synthetic service behavior.
  • Treat historical calibration as future work, not as a dependency for the MVP synthetic generator.

Phase Sequence

Phase Beads issue Depends on Purpose
01 - Deterministic spineislandflow-259.1NoneCreate the seeded generation foundation and canonical event output contract.
02 - Manifests, fixtures, CLIislandflow-259.2islandflow-zxh.1Turn deterministic generation into durable fixtures and manifests.
03 - Scenarios, labels, expected outputsislandflow-259.3islandflow-zxh.2Author named scenarios, separate labels, and expected derived outputs.
04 - Replay integrationislandflow-259.4islandflow-zxh.3Make replay consume synthetic runs with stable ordering and output comparison.
05 - Demo and load profilesislandflow-259.5islandflow-zxh.4Expose named deterministic demo/load profiles after replay validation.
99 - Future historical calibrationislandflow-259.6islandflow-259.5Calibrate parameters from historical data later, after the MVP is stable.

PR Split Notes

Phase 03a

islandflow-259.3.1 - Scenario catalog and labels

Keep scenario authoring and ground-truth label shape focused before expected-output comparison grows around it.

Phase 03b

islandflow-259.3.2 - Expected-output manifests

Store expected derived outputs as reviewable artifacts for downstream smart-flow validation.

If any other phase starts touching unrelated service, API, UI, and storage behavior in one PR, split it before implementation continues.

Matching Beads Epic

islandflow-259 - Plan synthetic market-data implementation phases.