Pipeline · stage 11

track()

Replies, opens (where consented), bounces, and downstream meeting bookings. Wired back into the eval set.

Inputs and outputs

In. Inbound mail through your IMAP. Calendar events through your calendar integration. GA4 events from per-lead PDF views.

Out. Per-lead funnel: sent → opened → replied → meeting booked. Per-angle reply-rate updates. Eval-set entries for any negative outcome.

Current version

v2.0.0 Forkable through the prompt library.

Model defaults

Reply classification is a small hosted classifier; we use it to label replies as positive, negative, neutral, out-of-office, or wrong-person. Anything ambiguous goes to the human. You can override per stage in your routing config.

How it fails

Threading. Reply detection across forwarded chains and aliases is hard. We get the easy 92% right; the rest get flagged for manual labelling.

Evals

Reply classifier agreement with human labellers on 3,000 replies: 94.1%. The full eval set ships with the prompt and is editable. Eval format.