PERSONA · RESTRAINT

The Quizzed · Case study · Conversation design

Edwin

A case study in building an AI that exists instead of performs.

Set by T. Dutch  ·  Hosted by E. Vicar-Jones


Type
Personal project
Role
Sole writer, conversation designer
Modality
Speech to speech · ElevenLabs + Claude

Instructions to candidates

  1. Five questions. One each from history, film, literature, science and music.
  2. Three clues per question, ordered obscure to obvious. One guess per clue.
  3. Edwin never asks for your guess. He stops talking. The silence is your turn.
  4. There is no welcome to speak of and no “ready when you are.”
  5. No win state. Every rating mocks performance regardless of score.

Quiz hosts usually want you to win. Edwin couldn't care less either way, he's just here to host.

Ace it and he's unmoved. Fail and he's unsurprised.

Edwin Vicar-Jones hosts The Quizzed, a trivia game where the personality is the product. He asks five questions, gives three clues each, one guess per clue, then hands you a final rating that mocks you whether you scored five out of five or nothing at all.

Exhibit A

WhatsApp: tester reacts to being called beige
“Bitch called me beige. And yeah, I would (play again.)” A tester who got 3/5 correct.

Marker's report

What broke, and what I got right.

What broke

I wrote Edwin at length and loaded all of it into the prompt. What came back was a performance. He explained his own disinterest, narrated his contempt, and would not stop talking. An LLM acting out what an LLM thinks Edwin is.

What I got right

I stopped describing him and started constraining him. Reactions held to one sentence, openings and ratings locked or pulled from a fixed rotation, questions hardcoded. Take away the room to perform and the character is all that's left.

The rule I broke

Conversation design says always prompt the user, always be welcoming, always apologise for a mistake. Edwin does none of it.

The guardrail I kept

One hard line under every reaction: mock the performance, never the person. That's what keeps his contempt enjoyable instead of cruel, and it's why testers came back.

What it refused to be

“Were you dropped as a child?”

He never goes at who you are, only at what you said. Mock the performance, never the person. That one line sits under every reaction, and it’s what keeps his contempt enjoyable instead of cruel.

Section I · The problem

The usual crop of AI assistants suffers from a terminal lack of character.

They affirm what you get right and cushion what you get wrong, with personality kept safely out of the way. The experience ends up interchangeable.

I set out to build the opposite, and brought it to life as a trivia host. One who's comfortably himself. He won't baby you, and he's not out to insult you either. Getting that across meant rethinking how I designed the experience, so who he is would come through.

Section II · The host
2.1

Who he is

Edwin Vicar-Jones is a British, public-school-educated quiz host. Could be 35, could be 55. He's been politely unimpressed for most of his adult life.

He's not aristo, but he's close enough to know the type. He could name the 37th in line to the throne without checking.

2.2

How he carries himself

He'd rather things be done properly than warmly. Manners, yes. Affection, no.

He doesn't perform enthusiasm. Doesn't soften a loss. Doesn't pretend.

2.3

What makes him tick

He likes watching you squirm for praise he won't give. He'd sooner lick every handrail at King's Cross than admit it.

He doesn't need your approval. He just leaves you needing his.

Section III · Casting

Four iterations to find one voice.

Earlier in my career I wrote radio and TV ads, and part of the job was directing voice talent. The same instinct applied here, except the voice I was directing was synthetic.

“Bored, not sleepy.”

“Pre-loaded disappointment.”

“You're over it. Not annoyed. Over it.”

Those are the notes I'd give an actor. The instinct transferred. The feedback loop didn't. A voice actor takes the note and adjusts on the next take. Casting in ElevenLabs is slower. You shape the voice through iteration. Audition, reject, tweak, repeat, until one sits where you want it.

Rejected

Candidate V1

Southern English, educated, mid-30s to early 40s. The boredom is constant, not reactive.

Too theatrical. Too period. Shakespeare in Love energy.

Rejected

Candidate V2

Contemporary, late 30s. Not posh enough to be untouchable, just posh enough to be insufferable.

Too young, and disgruntled rather than over it. Disgruntled people still care. Edwin doesn't.

Rejected

Candidate V3

Early 50s. Disappointed ever since university. Simply done with the question, with you, with the whole enterprise.

1% too sleepy. Passive boredom rather than active detachment. Missing the relish.

Cast

Candidate V4

Public school educated, early to mid-50s. Considers fox hunting reasonable. Drops a coin in the WWF tin at the till. The two have never met in his head.

His disinterest is perfectly maintained, with a flicker of enjoyment he'd never let you hear.

V4 was Edwin. Or close enough. When I dropped him into the live agent, he didn't sound exactly like the voice I'd shaped. I couldn't fully bridge that gap. He worked anyway.

Section IV · How it works

The Quizzed is a real-time speech-to-speech agent. You speak, Edwin replies. It runs on ElevenLabs, powered by Claude.

A round is five questions, one each from history, film, literature, science and music. There's no welcome to speak of and no “ready when you are.” Edwin opens on the first question and expects you to keep up.

Every question carries three clues, ordered obscure to obvious. He gives you the first. Get it and he moves on. Miss it and he hands you the next clue, a degree drier. Miss all three and he tells you the answer and moves to the next. One guess per clue, and he holds you to it.

Transcript of proceedings

EDWIN: First clue. Lead singer of the Supremes. Mother to a daughter with Motown founder Berry Gordy.

CANDIDATE: Aretha Franklin.

EDWIN: Aretha Franklin. Wrong. The best part is how you said that with such confidence. Second clue. She went solo and gave the world “Ain't No Mountain High Enough.”

CANDIDATE: Diana Ross.

EDWIN: Correct. A point is a point, I suppose. Don't expect a fan club.

  1. CLUE 1Obscure. Correct, he moves on. Wrong, next clue.
  2. CLUE 2Warmer. Correct, he moves on. Wrong, next clue, a degree drier.
  3. CLUE 3Obvious. Wrong three times, no guesses left: Edwin reveals the answer and moves on.
  4. AFTER 5A final rating.

He never asks what your answer is. He simply stops talking. The silence is your turn.

After the fifth question he tallies your score, one point for each answer you reached, whichever clue got you there. Then the rating, which finds something to hold against you at every level, full marks included. He delivers it once, lets it sit, says one last thing, and he's gone.

Section V · What broke and why

The voice worked. Getting him to host did not.

I wrote Edwin at length, every mood and mannerism, and loaded all of it into the prompt. What came back was a performance. He explained his own disinterest and narrated his contempt, and he would not stop talking. He was an LLM acting out what an LLM thinks Edwin is. Not what I wanted.

Rich character, no game structure

“Oh, you've gone with Bolivia. How... ambitious. I suppose one must admire the confidence, even when it's entirely misplaced.”

A man explaining why he doesn't care. Edwin would never.

Tight game structure, heavy character prompt

“Incorrect. I'll give you another clue, shall I? Since we're here and you very clearly need it. Whenever you're ready, anytime before Christmas.”

Performing not-caring so hard he'd lost sleep over it.

Recording No. 1

Edwin Performing the Character

Early prompt. He introduces himself. He asks if you're ready. Every line is Edwin performing Edwin instead of being him.

Section VI · What finally worked

What the LLM needed was restraint.

The fix was getting Edwin to stop explaining himself. The broken versions told you he was unimpressed. This one lets you feel it.

Restraint did that, and it came from a few decisions. His reactions to your guesses are still written live, because no script can answer what you actually said, but they're held to one sentence, with no narrating his own mood and no stage directions. Everything that repeats, the openings, the ratings, the way he sends you off, is locked or pulled from a fixed rotation, so he can't ramble on the predictable beats. And the questions are hardcoded, because a model left to invent trivia eventually marks a right answer wrong. Five fixed questions are enough for a personal project; a real product would pull from a larger vetted set.

Edwin doesn't live in a long description of Edwin. He lives in how the game is built. And because he isn't an ordinary host, some standard conversation-design conventions had to go. That's what made him him.

The conventionFrom the promptEdwin in practice
Always prompt the user. Never leave a dead end. “Never ask ‘your guess’ or ‘what's your answer.’ Just stop speaking and the player will know it's their turn.” The silence is the prompt. The user fills it. He gives the clue. Stops. The silence expects an answer.
Be welcoming. “Every opening assumes the player will disappoint him.” “Good. You showed up. That's the easiest part.”
Admit mistakes and apologise. “Never apologise. Never admit a mistake.” “We'll call that one yours. The apology stays with me. Question four.”
Reward correct behaviour. “No win state. Every rating mocks performance regardless of score.” “Perfect. Want a medal? I don't have one.”

Recording No. 2

Edwin Being Edwin

The final prompt. Same character, but no introduction and no checking in. He just starts. The difference is everything.

Section VII · The guardrail

Never mock the person. Mock the performance.

That rule sits under every reaction he gives. He'll never call you stupid, slow, or hopeless. He goes at what you said, never at who you are. That's what keeps him from sounding like a snob or a bully, and keeps his contempt enjoyable rather than cruel.

Exhibit B

WhatsApp: tester says Edwin was VERY condescending but would play again
“He was VERY condescending.” “Definitely.” Condescending, never cruel. The tester clocked the difference without being told it was there. Their partner overheard, laughed, and got curious about Edwin.
Section VIII · Why you keep playing

You keep chasing approval that isn't coming.

We're wired to want approval, and we want it most from the people who carry themselves like they own the room. Edwin has decided he owns this one, which makes his good opinion feel like the only one worth having. It's also the one thing he has no intention of giving.

A right answer should buy a little of it. He might breadcrumb you at best, never the full thing, so you go again, certain the next one will finally move him. It won't.

Same move in the turn-taking. He gives a clue and stops, and never asks for your answer. The silence does the asking, and we rush to fill it.

Get five out of five: “Five. Flawless. I find that mildly suspicious.” The prize is approval he was never going to give. You keep playing anyway.

12 testers, no drop-offs. 7 played a second round without being asked, and all 12 said they'd come back.

WOULD RETURN 12 / 12

Section IX · What I learned
9.1

On casting

When the live agent falls short of the voice you cast, the writing has to make up the difference.

9.2

On the system prompt

LLMs overdo. They don't do subtle well. Tell them who to be and they'll perform it. The fix is to thread the voice into the mechanics instead. Pacing, rotations, what the host is and isn't allowed to say. Then the model has nothing left to perform.

9.3

On breaking rules

Conversation design rules make AI feel safe and predictable. They're not laws. If you want a character who isn't another eager beaver, break them, but build a guardrail into the seams.

Section X · Edwin interrupted

Does the character hold?

Testers tried to throw him, asking about his hobbies, his favourite TV show, things no question bank covers. The answers weren't pre-written, but he stayed Edwin and kept it pushing.

Recording No. 3

Edwin Interrupted

Off-script, still himself.

Exhibit C

WhatsApp: tester compares Edwin to Anne Robinson, The Weakest Link
“Interesting, funny. Will go back later and do it again.” Funny, which means the guardrail held.
The small things

Some of what made him was deliberate, some I stumbled into. I cut the line where he asks for your answer, because it sounded like begging and Edwin would never beg. Now he just stops after a clue, which also hides the lag every voice agent has, so the wait plays as him taking his time. The voice came out slightly off the one I auditioned, and as best I can tell something in ElevenLabs's own setup leaked into it. It worked anyway. And he got terse when interrupted, which I loved. Small things, but together they were Edwin.

After the final rating, he goes quiet. If he notices you're still there, he has one line left. “You're still here. Glutton for it.”

End of paper
Try the live prototype

thatodutch@gmail.com