Building a Chat App

This guide walks you through building a complete chat application from blocks to React UI to tests. Along the way, you'll understand what the framework does behind the scenes and how the concepts connect.

What we're building: A chat app where an LLM generates responses, a handler tracks message count in session state, and a React UI shows the conversation plus the count. Items stream in real time. Tests run without real LLM calls.

Concepts we'll cover: Generator slots (model, prompt, history, user), partial state schemas, sequencer composition, flow definition (kind, actions, userMessage, the client block), server registration, React hooks, and deterministic testing.

1. Define the blocks

The generator — talks to the LLM

Generators are where AI happens. The framework manages the tool loop, streaming, and prompt assembly. You configure how the model receives context through four slots.

src/flows/hello-chat/blocks/chat-gen.ts
import { generator } from "@flow-state-dev/core";
import { z } from "zod";

export const chatGen = generator({
  name: "chat",
  model: "preset/fast",
  prompt: "You are a helpful assistant. Be concise and friendly.",
  inputSchema: z.object({ message: z.string() }),
  history: true,
  user: (input) => input.message,
});

What each slot does:

model — The model ID. The framework resolves this at runtime via a model resolver (OpenAI, Anthropic, etc.). You can override it per request for testing or A/B runs.
prompt — The system instruction. Sent first in every model call. Can be a string or a function (input, ctx) => string for dynamic prompts.
history — Prior conversation messages. This is how the model remembers what was said before. The framework calls ctx.session.items.history() to get completed messages in { role, content } format, filters and formats them, and injects them into the prompt. Without this, each request would be stateless.
user — The current user input. Extracted from the action input. In our case, it's input.message. The framework adds this as the final user-role message before calling the model.

The framework assembles the full prompt in order: prompt, context (if any), history, user. It handles tool calls, retries, and streaming. You focus on what goes in, not how it gets there.

Optional slots: You can add a context slot for extra data (e.g. retrieved documents). You can add tools to enable function calling. The generator will loop: call model, execute tool, call model again, until the model stops or hits a bound. All of that is framework-managed.

The handler — tracks usage

Handlers are pure logic. No LLM. They validate, transform, and mutate state.

src/flows/hello-chat/blocks/counter.ts
import { handler } from "@flow-state-dev/core";
import { z } from "zod";

export const counter = handler({
  name: "counter",
  sessionStateSchema: z.object({ messageCount: z.number().default(0) }),
  execute: async (_input, ctx) => {
    await ctx.session.incState({ messageCount: 1 });
  },
});

Partial state schema: The handler declares only messageCount. It doesn't need to know about other session state fields. This partial schema bubbles up to the flow level, where the framework merges it with any other blocks' declarations. The benefit: blocks stay portable and self-documenting. A counter block can be reused in flows that have different full schemas.

incState is atomic: Unlike patchState (which does a read-modify-write), incState is a single CAS-guarded operation. Safe under concurrent requests. If two messages arrive at once, the count increments correctly. Use incState for counters and other commutative updates. Use patchState when you need to set specific values.

2. Compose the pipeline and flow

The sequencer — composition primitive

A sequencer chains blocks into a pipeline. Each step's output becomes the next step's input.

const pipeline = sequencer({ name: "chat-pipeline", inputSchema })
  .step(chatGen)
  .tap(counter);

Why this order? The generator produces the assistant response. The counter runs after, incrementing session state. We attach it with .tap() because it has no transformation — it only mutates state, so the generator's output flows through unchanged as the pipeline result. If we put the counter first, we'd count before the LLM replied. Order encodes data flow.

The sequencer is the composition primitive. It replaces the agent-vs-workflow split. You chain blocks. Conditional logic, parallelism, and error recovery come from sequencer methods like stepIf, parallel, and rescue. For a simple chat pipeline, .step() is all you need. As flows grow, you'll use more of the DSL.

The flow definition

src/flows/hello-chat/flow.ts
import { defineFlow, sequencer } from "@flow-state-dev/core";
import { z } from "zod";
import { chatGen } from "./blocks/chat-gen";
import { counter } from "./blocks/counter";

const inputSchema = z.object({ message: z.string() });

const pipeline = sequencer({ name: "chat-pipeline", inputSchema })
  .step(chatGen)
  .tap(counter);

const chatFlow = defineFlow({
  kind: "hello-chat",
  requireUser: true,

  actions: {
    chat: {
      inputSchema,
      block: pipeline,
      userMessage: (input) => input.message,
    },
  },

  session: {
    stateSchema: z.object({
      messageCount: z.number().default(0),
    }),

    client: {
      expose: ["messageCount"],
    },
  },
});

export default chatFlow({ id: "default" });

What each part means:

kind — The flow identifier. Becomes the URL path: /api/flows/hello-chat/actions/chat. Clients use it to target this flow.
actions — The flow's public API. Each action is an entry point. Clients invoke them by name: sendAction("chat", { message: "Hello" }). The framework validates input against inputSchema, resolves the session, and executes the block.
userMessage: (input) => input.message — Before block execution, the framework emits a user-role message item with this text. That way the conversation stream shows what the user said. Without it, you'd only see assistant messages. The generator above also sets user: (input) => input.message. Both being wired to the same source is intentional: they are complementary contracts (the action item is durable and feeds future history; the generator slot resolves this turn's LLM input), and the framework deduplicates equivalent content so the model sees the user's turn exactly once. See Generator context > User slot for the full explanation.
client — The sole gateway for exposing server state to clients. Raw state never leaves the server. Each scope's client block declares what crosses to the browser via expose (verbatim by name) and derived (computed projections). Here we expose messageCount so the UI can display it. The client view is rebuilt when the framework produces a state snapshot (e.g. after request.completed). The client fetches snapshots via GET /api/flows/sessions/:sessionId/state.

3. Set up the server

List the flow in one config object. Get a complete REST API.

lib/flowstate.ts
import { createFlowState, inMemoryStores } from "@flow-state-dev/server";
import chatFlow from "@/flows/hello-chat/flow";

export const flowstate = createFlowState({
  flows: { chatFlow },
  models: { default: "openai/gpt-5.4-mini" },
  stores: { default: { primary: inMemoryStores() } },
});

app/api/flows/[...path]/route.ts
import { flowstate } from "@/lib/flowstate";
import { createVercelNextHandler } from "@flow-state-dev/vercel/next";

export const { GET, POST, PATCH, DELETE } = createVercelNextHandler(flowstate);
export const runtime = "nodejs";
export const maxDuration = 300;
export const dynamic = "force-dynamic";

What you get: Action execution, session management, SSE streaming with resume, and state snapshots. No route wiring.

Key endpoints:

POST /api/flows/hello-chat/actions/chat — Execute the chat action (new or existing session)
GET /api/flows/hello-chat/requests/:requestId/stream — SSE stream for that request
GET /api/flows/sessions/:sessionId/state — State snapshot (clientData)

The catch-all route [...path] lets the framework handle routing internally. One file, full API. You configure models, stores, and settings on createFlowState. See Server Setup for details.

4. Build the React UI

The React hooks handle streaming, reconnection, and state sync. You render.

src/components/ChatApp.tsx
import {
  FlowProvider,
  ItemRenderer,
  useFlow,
  useSession,
  useClientData,
} from "@flow-state-dev/react";

function ChatApp() {
  return (
    <FlowProvider flowKind="hello-chat" userId="devuser">
      <ChatUI />
    </FlowProvider>
  );
}

function ChatUI() {
  const flow = useFlow({ autoCreateSession: true });
  const session = useSession(flow.activeSessionId, {
    items: { visibility: "ui" },
  });

  const clientData = useClientData(session, {
    session: ["messageCount"],
  });

  const handleSubmit = (e: React.FormEvent<HTMLFormElement>) => {
    e.preventDefault();
    const form = e.currentTarget;
    const message = new FormData(form).get("message") as string;
    if (message.trim()) {
      session.sendAction("chat", { message });
      form.reset();
    }
  };

  return (
    <div>
      <header>
        <h1>Chat</h1>
        <span>{clientData.session?.messageCount ?? 0} messages</span>
      </header>

      <div>
        {session.items.map((item) => (
          <ItemRenderer key={item.id} item={item} />
        ))}
      </div>

      <form onSubmit={handleSubmit}>
        <input
          name="message"
          placeholder="Type a message..."
          autoComplete="off"
        />
        <button type="submit" disabled={session.isStreaming}>
          {session.isStreaming ? "Working..." : "Send"}
        </button>
      </form>
    </div>
  );
}

What each hook does:

FlowProvider sets up context. Every hook below it receives flowKind and userId. Child components use this to target the right flow and user. Wrap your app or a section of it.
useFlow with autoCreateSession: true creates a session on mount if none exists. It tracks activeSessionId so you know which session to subscribe to.
useSession connects to the SSE stream for that session. It delivers items in real time, provides sendAction and isStreaming, and re-renders when items or status change. The visibility: "ui" option filters to items the client should display (messages, components, etc.).
useClientData reads from the latest state snapshot. You list which clientData keys to subscribe to. The hook refetches when the snapshot changes (e.g. after request.completed).

5. Write tests

No real LLM calls. No network. Deterministic results.

src/flows/hello-chat/__tests__/flow.test.ts
import { testFlow } from "@flow-state-dev/testing";
import chatFlow from "../flow";

test("chat action streams a response and increments count", async () => {
  const result = await testFlow({
    flow: chatFlow,
    action: "chat",
    input: { message: "Hello!" },
    userId: "testuser",
    generators: {
      chat: { output: "Hi there!" },
    },
  });

  // User message was emitted
  expect(result.items).toContainEqual(
    expect.objectContaining({ type: "message", role: "user" })
  );

  // Assistant message was emitted
  expect(result.items).toContainEqual(
    expect.objectContaining({ type: "message", role: "assistant" })
  );

  // State was updated
  expect(result.session.state.messageCount).toBe(1);
});

test("message count accumulates across requests", async () => {
  const result = await testFlow({
    flow: chatFlow,
    action: "chat",
    input: { message: "Second message" },
    userId: "testuser",
    seed: {
      session: { state: { messageCount: 3 } },
    },
    generators: {
      chat: { output: "Response" },
    },
  });

  expect(result.session.state.messageCount).toBe(4);
});

Testing philosophy: The test harness creates an isolated runtime with in-memory stores. It mocks generators by name: generators.chat.output replaces the real LLM call with that string. Same contracts as production: validation, session resolution, block execution, state persistence, lifecycle hooks. No flakiness from API latency or rate limits.

Seeding state: Use seed.session, seed.user, or seed.project to simulate scenarios. Here we start with messageCount: 3 and verify the handler increments to 4. Useful for multi-turn flows, permission checks, and state-dependent behavior.

Generator mocking: The key in generators matches the block name. Our generator is named "chat", so generators.chat.output replaces its output. For generators with tools, you can mock tool results too. See the Testing docs for the full API.

Next steps

Add custom renderers to style messages, reasoning, and components
Add tools to the generator for function calling (search, create artifacts, etc.)
Use control flow patterns for conditional logic, parallelism, and error recovery
Add resources and clientData for richer state. See the kitchen-sink reference app for a full demonstration

1. Define the blocks​

The generator — talks to the LLM​

The handler — tracks usage​

2. Compose the pipeline and flow​

The sequencer — composition primitive​

The flow definition​

3. Set up the server​

4. Build the React UI​

5. Write tests​

Next steps​