Streaming
APIDefinition
An API mode where Claude sends partial response tokens as they are generated, rather than waiting for the full response. Reduces perceived latency for users. Use server-sent events (SSE) to consume the stream. Not available with the Message Batches API.
Example Usage
Enable streaming for interactive chat interfaces to show partial responses and reduce time-to-first-token.