Skip to content

feat: liveness signal for wedged provider streams (5min chunk timeout hides stalls) #957

@anandgupta42

Description

@anandgupta42

Found during the v0.8.8 release review (Chaos Gremlin / on-call SRE, P2 deferred).

#844 raised DEFAULT_CHUNK_TIMEOUT from 2min to 5min. The watchdog (wrapSSE in provider.ts) only fires when a chunk fails to arrive within the window, so a genuinely wedged provider stream (connection alive, no data) now hangs a user for ~5 minutes before the timeout aborts — up from ~2. Errors are surfaced, not silent, but the user-visible stall doubled.

This was an intentional tradeoff (slow warehouse/LLM streams were tripping the 2min watchdog), so it is deferred. Possible follow-up: a separate, shorter "no first byte" / liveness signal distinct from the steady-state chunk timeout, or surfacing a "still waiting…" indicator past N seconds so a wedged stream is visible before the hard timeout.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions