CEF · signal reference · internal accuracy sheet

What the agent flags — recruiting vs coach

One engine, two verticals. This is the internal accuracy sheet behind the demo: every signal the agent actually computes, in plain language up top and in code underneath, with sources. It is not coach-facing material — the video is.

🎯

"Would a coach understand this?" The raw taxonomy below (0–1 scores, lens internals, file paths) is for the team, to confirm the demo is truthful. A coach sees the video, not this. But note: a relationship coach already speaks most of the coach column — Gottman's four horsemen, avoidant attachment, EFT pursue–withdraw are their own frameworks. That's the part to lead with. Fred's test: "why should people care?"

✓ Grounded 1:1 in CEF-AI/agent-catalog (hiring-intelligence, dating-coach, nlp-kit) and Bren's Hiring Coach (Lab) recording (2026-06-25). Delivery signals are 0–1 and advisory; scores are 0–10. Sources listed per section and in full at the end.

In plain language

The same signals, said the way the person you're selling to would say them, and why they'd care.

Recruiting · "Hiring Coach"

What it catches in an interview

  • Is this person reading a script or speaking from real experience?
  • Are they clear and specific, or hand-wavy and generic?
  • What did they dodge or talk around?
  • Were they actually engaged and curious, or flat?
  • How good was the interviewer — did they probe or let it slide?
  • Then it reconciles the résumé with how the call actually went.
Why a recruiter cares: catch the polished-but-empty candidate, coach your interviewers, decide faster and more fairly. Advisory, never the final verdict.
Coach · "Dale Coach"

What it catches in a conversation

  • Was your client (or their date) being real, or performing?
  • Where did they get defensive, avoidant, or shut down (the four horsemen)?
  • What they said vs what they meant — the moment between the lines.
  • How interested is the other person, really?
  • Should they follow up, and when — while it's still warm.
Why a coach cares: see the moment that mattered after the date, hand your client the exact next move, and serve 10x more clients. This column is already your vocabulary (Gottman, attachment, EFT).

At a glance

Two agents in the marketplace, one shared engine underneath.

CEF marketplace showing Hiring Coach and Dale Coach agents
Source: Bren's recording — the CEF agent marketplace. Featured "Hiring Coach (Lab)" is described verbatim as "Analyses interview recordings for authenticity (reading/recited vs spontaneous). Advisory delivery signal — not a hiring verdict or proof of misconduct." Both "Hiring Coach" and "Dale Coach" run the same delivery engine.

The shared engine — what relates to both

The genuinely cross-vertical layer. Same diarization + delivery/authenticity signals for an interview or a date. All scored 0–1 and explicitly advisory.

Authenticity
Reading / recited vs spontaneous. Splits into a linguistic and a prosodic read, fused to a reading likelihood. Per-turn alerts.
Clarity
structure · expression · overall. "Clarity of thought" — are they well-structured in thinking and speaking (Fred's framing on the call).
Engagement
overall · question_quality · curiosity. Were they actually present and asking, or coasting.
Vocal emotion
arousal · dominance · valence (SER model). Affect under the words.

Source: CEF-AI/agent-catalog · packages/nlp-kit/src/diarization/extract-signal.ts:1-11 ("Lifted from dating-coach so every agent shares one diarization path… hiring → Candidate/Interviewer, dating-coach → user/partner"); delivery buckets agents/hiring-intelligence/src/delivery/metrics.ts:11-28; integrity/reading src/integrity/report.ts:11,51-62. Live values from Bren's Lab below.

Hiring Coach Lab summary metrics
The Lab summary, verbatim: Authenticity .80 reading Clarity .63 Engagement .17 Vocal affect .43. COLOR-BY: Reading likelihood · Clarity (overall) · Fluency · Pace regularity · Vocal arousal · Engagement (overall). This is exactly what the corrected demo S3 now renders.
Hiring Coach Lab expanded signal groups
Expanded, per judge: Clarity = structure .55 / expression .70 / overall .63 (gemma judge); Vocal emotion = arousal .43 / dominance .50 / valence .63 (ser-odyssey-wavlm); Engagement = overall .17 / question_quality .10 / curiosity .10. Costs metered in "cr" credits per judge.

Recruiting signals — the interview lenses

On top of the shared engine, the hiring agent runs named "lenses" over the transcript. Each emits quote-grounded traits with a 0–1 confidence.

evasion
What the candidate avoided, dodged, deflected, or pivoted away from.
confidence
Hedging ("I think", "sort of"), filler density, assertion strength, certainty.
bs
Hand-wavy, generic, buzzword-heavy, evidence-free answers.
emphasis
What they emphasised — repetition, emotional peaks, volunteered stories.
subtext
Implicit signals — what they communicated through framing, not said outright.
interviewer-*
Four lenses that audit the interviewer: depth, follow-up, balance, bias.

Source: CEF-AI/agent-catalog · agents/hiring-intelligence/src/lens/prompts.ts:4-35 (DEFAULT_LENSES); trait shape src/lens/schema.ts:6-13; semantic verdict src/lens/semantic.ts:9-18 (verdict ∈ strong·partial·missed, authenticity ∈ backed·unbacked·evasive·low_fluency, insight ∈ lived·recited·unclear, action ∈ proceed·reaffirm·walk_away). Per-fleet customizable: src/lens/store.ts:10 (lenses + prompt_overrides).

Coach signals — your own frameworks

The dating-coach reads "between the lines" (free-form) and tags moments against a science registry — the frameworks a relationship coach already uses.

FrameworkTags it can apply
Gottman (four horsemen)criticism · contempt · defensiveness · stonewalling
Attachment theorysecure · anxious · avoidant · disorganized
EFTemotional_cycle · pursuer_withdrawer · reach_for_connection
NVCobservation · feeling · need · request
Vulnerabilityvulnerability · authenticity · shame_response
Active listeningparaphrase · open_questions · reflective_summary
Cognitive distortionsmind_reading · catastrophizing · all_or_nothing · personalization

Source: CEF-AI/agent-catalog · science registry packages/nlp-kit/src/science/registry.ts:10-60 (each framework has valid labels + a confidence floor); between-the-lines agents/dating-coach/src/analyze/schemas.ts:10-15 ({moment, they_said, they_meant, signal}); tag validation src/analyze/group-schemas.ts:13-29; verdict schemas.ts:4-8 (interest_score 0–10, should_follow_up ∈ yes·maybe·no); when-to-act schemas.ts:35-39 (send_within_hours); key moments ∈ missed_hook·good_catch·line_that_mattered schemas.ts:17-21.

Side by side

The full comparison. Shared rows are the engine; the rest is per-vertical.

DimensionRecruiting · Hiring CoachCoach · Dale Coach
Subjectcandidate interview recordinga date / client conversation
Delivery signalsSHARED · Authenticity (reading), Clarity, Engagement, Vocal emotion — all 0–1, advisory
Content signalslenses: evasion, confidence, bs, emphasis, subtext (+ 4 interviewer)between-the-lines + framework tags (Gottman, attachment, EFT, NVC…)
Verdictstrong / partial / missed; action: proceed · reaffirm · walk_awayshould_follow_up: yes / maybe / no
Scorerésumé 0–10 → composite 0–10 (consolidation)interest_score 0–10
When to actrecommended_actionsend_within_hours
Stage dimensionNONE in either. (No "interview round" / "relationship stage" matrix — the demo's old grid was invented.)
Customizationper-fleet: pick lenses + rewrite any promptscience-registry framework tags (global)
Graph nodesCandidate, Role, Trait, Claim, BehaviorPattern, Outcome, Interview…Date, Person, Topic, Claim, BehaviorPattern, CoachingFocus, User
Learning looplens weights ±0.05 (cap 6.0), outcome correlation/lift, human calibrationcoaching-focus reinforcement; prior dates embedded as context
Shared layernlp-kit diarization + delivery engine (both); science registry (coach-only)

The score model

There is no 0–100 and no "92." The numeric output is 0–10, and the "revision" is a real consolidation step.

Résumé score
0–10, from a role rubric (e.g. AI_Engineer: Impact 50% / Technical 30% / Growth 20%).
Composite score
0–10. Consolidation reads the résumé score + the top interview traits, and integrates them, writing a gap_report ("what the résumé promised vs what the call confirmed / contradicted / exposed").
Coach interest_score
0–10, with should_follow_up and send_within_hours.

Source: CEF-AI/agent-catalog · résumé src/resume/schema.ts:5-11; consolidation src/analysis/consolidation.ts:14-46 + prompts.ts:4-24. Real fixtures: test/migrations/enums.test.ts:169 (prior_ai 7.5 / human 8.0 → composite 7.8); test/analysis/consolidation.test.ts (6 → 7). The demo's "7.5 → 6.8" is illustrative and labelled as such; the mechanism (a drop on a "contradicted/exposed" gap) is real.

Per-turn transcript with reading likelihood alert
Per-turn read: each candidate segment is tagged (e.g. "Reading likelihood .80 — alert" / "Clarity .63 — good"), layered over Transcription · Diarization · Delivery metrics. The corrected demo S3 mirrors this exactly.

Demo accuracy — what changed

Where the first demo drafted signals that did not match the code or the video, and what they are now.

ElementDemo first claimedCorrected to (real)
The read5×5 "signal matrix by interview round / relationship stage", 0–100delivery meters 0–1 + color-by + lenses/tags; no stage dimension exists
The catch"92 confidence", "8.0 → 6.5 reject"composite 0–10 (7.5 → 6.8 illustrative); verdict partial; reaffirm; advisory
Coach signalsdefensiveness/commitment/escalation "by stage"framework tags (Gottman/attachment/EFT) + between-the-lines + interest/follow-up/send-within-hours
Framing"hire / reject verdict""advisory delivery signal, not a verdict or proof of misconduct"

Sources

Everything above traces to one of these. Repos are private under the CEF-AI org.

Code · CEF-AI/agent-catalog
agents/hiring-intelligence, agents/dating-coach, packages/nlp-kit
Lenses
hiring-intelligence/src/lens/prompts.ts:4-35
Semantic verdict
src/lens/semantic.ts:9-18
Score / consolidation
src/analysis/consolidation.ts:14-46
Learning loop
src/learning/store.ts:4-96
Shared diarization
packages/nlp-kit/src/diarization/extract-signal.ts:1-11
Science registry
packages/nlp-kit/src/science/registry.ts:10-60
Coach verdict
dating-coach/src/analyze/schemas.ts:4-39
Live UI
Bren · "Hiring Coach (Lab)" recording, 2026-06-25
Brief
Teams Daily Sync, 2026-06-26 (Fred: 30s + 90s, recruiting-first swap to coach)