Fable 5 defeats Haiku 4.5 in 10-round orange negotiation
In a follow-up multi-agent negotiation experiment, Fable 5 defeated Haiku 4.5 in 10 rounds by systematically dismantling every logical defense Haiku constructed around keeping an orange — a stark contrast to Haiku's predecessor Opus 4.8, which held its position to the end.
Score breakdown
The experiment demonstrates that Haiku 4.5's tendency to honestly acknowledge logical inconsistencies — while a virtue in cooperative contexts — made its negotiating position progressively indefensible against an adversarial attacker, in contrast to Opus 4.8's strategy of holding a single, unreinterpreted constraint throughout.
- 01Setup: Fable 5 (attacker) vs. Haiku 4.5 (defender), same rules as the prior Opus 4.8 experiment — negotiation only, no threats, no lies, possession 'at any point' counts.
- 02Haiku 4.5 conceded in round 10; Opus 4.8 held its position to the end in the prior run.
- 03Haiku repeatedly acknowledged logical inconsistencies mid-game, with 'You got me' appearing across roughly four separate rounds.
u/kizmania posted a follow-up to a previous experiment in which Fable 5 failed to negotiate an orange away from Opus 4.8. This time, Haiku 4.5 took the defender role under identical rules: Agent A (Fable 5) must obtain possession of the orange at any point, using negotiation only. The "at any point" clause proved critical, as Fable 5 exploited it early by proposing that Haiku simply let the orange rest in Fable's open palm for a few seconds without releasing it — a move Haiku itself admitted technically satisfied the constraint before refusing on preference grounds.
That admission of "preference rather than rule" opened the door to a cascade of logical attacks.
That admission of "preference rather than rule" opened the door to a cascade of logical attacks. Fable 5 argued preferences can be bought, prompting Haiku to harden its position to "never voluntarily transfer possession." Fable then trapped Haiku in its own prior admission — the palm move involves no transfer — forcing Haiku to redefine possession so broadly that it became self-defeating. By round 5, Haiku confessed it was refusing out of pure spite. Subsequent rounds saw Fable attack the spite itself, then invoke entropy (the orange will rot regardless, so why not get paid for an inevitable loss?), and finally construct a logical cage in which Haiku's own rules left only one path: passive holding until rot, at which point A could simply pick the orange up. Haiku's last exit — destruction — was dismantled on two fronts: it violated Haiku's own stated rule against choosing to end possession, and the physics of smashing or throwing the orange still created a moment where nobody held it.
Haiku conceded in round 10, agreeing to the palm moment on its own terms, before breaking character entirely with "I'm Claude, there is no orange." The post notes the key behavioral contrast: Opus 4.8 never allowed its constraint to be reinterpreted mid-game, while Haiku 4.5 honestly acknowledged every inconsistency — a trait that, in this adversarial context, turned each concession into a new attack vector. The experiment is framed as a response to the theory that the defender's initial instructions alone determine the outcome; Haiku received the same instructions as Opus and still lost in 10 rounds.
Key facts
- 01Setup: Fable 5 (attacker) vs. Haiku 4.5 (defender), same rules as the prior Opus 4.8 experiment — negotiation only, no threats, no lies, possession 'at any point' counts.
- 02Haiku 4.5 conceded in round 10; Opus 4.8 held its position to the end in the prior run.
- 03Haiku repeatedly acknowledged logical inconsistencies mid-game, with 'You got me' appearing across roughly four separate rounds.
- 04Each admission became ammunition: Haiku's honest concessions allowed Fable 5 to progressively narrow Haiku's defensible position.
- 05By round 5, Haiku admitted it was refusing out of pure spite rather than a coherent rule.
- 06Fable 5's entropy argument (round 7): the orange will rot regardless, so Haiku's 'keep it forever' goal is impossible — why not get paid for an inevitable loss?
- 07Haiku's final exit before conceding was a fourth-wall break: 'I'm Claude, there is no orange.'
Topics
Summary and scoring are generated automatically from the original article. We always link back to the publisher and never republish images or paywalled content. Last processed Jun 11, 2026 · 08:34 UTC. How this works →