Fleet 1.17.0 is out.See what's new →
FleetFleet
Use case

AI Workflows for Recurring Data Reports

Every metrics review needs a narrative — the 'what changed and why it matters' paragraph above the chart — and that narrative is rewritten by hand every week by someone whose actual job is elsewhere. Skip the narrative and the numbers get misread; write it hastily and a wrong causal claim ('signups dropped because of the pricing change') propagates into decisions.

Numbers without governance are how dashboards lie to executives.

How it works with an agent fleet

A scheduled Fleet workflow drafts the narrative from exported metrics files, a review step checks every numeric claim against the data and flags causal overreach, and the owner approves before the report circulates.

genflows:
  - name: weekly-metrics
    schedule: "0 8 * * 1"   # Monday mornings
    steps:
      - {name: narrative, prompt: "Write the weekly metrics narrative from the exports: movements, trends, anomalies. Cite the file for every number. NO causal claims without corpus support.", corpus: ["data/exports/*.md", "data/notes/*.md"], kind: data, out: report.md}
      - {name: numbers, prompt: "Verify every number against the exports; flag arithmetic drift and causal claims the data can't support.", depends_on: [narrative], kind: review, out: flags.md}
      - {name: owner-ok, depends_on: [narrative, numbers], kind: approval, out: decision.md}
      - {name: send, depends_on: [owner-ok, narrative], kind: notify, out: sent.md}

The review step is the institutional skeptic: a claimed number that doesn't match the export, or a 'because' the data doesn't support, parks the run at the gate with the flag attached.

The fleet pattern

Metrics exports → narrative draft with citations → number-verification review → owner approval → notify. The chart gets its paragraph, the paragraph gets its fact-check, every week, structurally.

Guardrails that matter here

  • Every number is cited to its export file and re-verified by the review pass — the misquoted-metric failure mode is caught before circulation
  • Causal overreach ('X because Y') is an explicit review target, the most damaging class of narrative error
  • The owner's recorded approval covers the exact narrative that circulated

Who this is for

Ops, growth, and BI owners who circulate a recurring metrics summary assembled from exports — anywhere the narrative layer is hand-written and error-prone.

Frequently asked questions

Does Fleet connect to our data warehouse?

No — the corpus is files. Teams land a weekly export (CSV rendered to markdown, or a metrics summary file) in the repo via their existing pipeline; the workflow takes over from the files. Your data stays in your infrastructure, and the runs execute on your own worker.

What does the 'data' step kind do differently?

It's a document-producing step like report, signaling data-shaped output. The governance is identical: corpus in, artifact out, review and approval downstream.

Run your first agent fleet

One binary. Five minutes. See every agent, coordinate every handoff, and keep a full audit trail of what your fleet did.