Streaming

API

Definition

An API mode where Claude sends partial response tokens as they are generated, rather than waiting for the full response. Reduces perceived latency for users. Use server-sent events (SSE) to consume the stream. Not available with the Message Batches API.

Example Usage

Enable streaming for interactive chat interfaces to show partial responses and reduce time-to-first-token.

Related Terms