AIGym
Silhouette of a figure haloed by a cloud of blue particles
A family of reasoning gyms · Built in Abu Dhabi

Train your reasoning.
With AI, not against it.

AIGym is the website for people who refuse to let AI think for them. A family of training rooms — board, interview, negotiation, crisis, diplomacy — where you write the reasoning and K2 Think V2 grades the quality of your judgment.

Gyms open2
In training9
Reasoning modelK2 Think V2 · MBZUAI

The manifesto

AI doesn't make you stupid.

Lazy use of AI does.

The popular complaint is that AI is hollowing out our thinking — that handing problems to a model trains the muscle of cognition to atrophy. There is real evidence for this. People who outsource every judgment do get worse at making judgments.

But the conclusion that AI is the problem is the wrong conclusion. It's a tool. A pen makes some people lazy and others write better. A calculator did not end mathematics. A great library doesn't make you smarter; it makes you smarter if you read demandingly.

The right use of AI is not to bypass thinking. It is to put your thinking under pressure. To find the gap between what you concluded and what a more rigorous version of you would have concluded — and to close it.

AIGym is a family of training rooms built on that idea. You enter a scenario. You write your reasoning. K2 Think V2 — one of the most demanding reasoning models in production — grades the structure, completeness, and honesty of your thinking, and tells you where it broke. Then you go back in.

This is the website for the people who refuse to let AI think for them. Who use it to think harder.

SeriousEditorialCalibratedBuilt in Abu Dhabi

The gyms

Different sports. Same discipline.

Every gym is built on the same engine: step into a scenario, write your reasoning, get graded on the quality of your thinking. The sport changes — the discipline doesn't.

2 open · 9 in training

Open now

Financial district skyline at dusk
Open

BoardGym

Train your judgment. The way directors do.

A flight simulator for fiduciary duty. Step into real board scenarios, write your own reasoning, and get scored on the quality of your judgment.

Stakeholder mappingInterest weightingInformation awareness
Empty modernist boardroom with floor-to-ceiling windows
Open

InterviewGym

Rehearse before you walk in.

A curated panel interview built from your CV, the JD, the interviewers' LinkedIn profiles, and the company's latest annual reports. Briefed, asked, debriefed — like the real thing.

SubstanceStructureSelf-awareness

In training

Dark wood deal room with leather chairs
In training

NegotiationGym

Train the move, not the script.

High-stakes negotiation scenarios. Hostile, friendly, multi-party. K2 grades the reasoning behind your move — anchor, concession, walkaway — not whether you closed the deal.

BATNA clarityInformation strategyAnchor & concession logic
Command room interior, low light, archival
In training

CrisisGym

Think under the news cycle.

Hour-by-hour incident response simulator. Breach, recall, scandal — write the next move, and the move after that, while the situation evolves around you.

Situational awarenessSequencingStakeholder cadence
Ornate empty parliamentary chamber
In training

DiplomacyGym

Reason like a foreign ministry.

Statecraft scenarios — sanctions, summits, treaty drafting, alliance management. Grades realpolitik against principle, not slogans.

Interest mappingSignal readingCoalition dynamics
Wall Street sign against historic financial-district facade
In training

CommitteeGym

Defend the thesis to the room.

A live investment committee. Present the deal, anchor the recommendation, take the questions. Graded on the reasoning a sharp IC actually grades — not on whether the deal closed.

Thesis structureDownside disciplineValuation logic
Audit working papers and pen on a clean desk
In training

AuditGym

Where the engagement turns.

Audit judgment under client pressure. Materiality, going concern, related-party calls, control failures — graded the way a senior partner grades a manager.

Materiality calibrationEvidence sufficiencyRisk re-assessment
Downing Street SW1 street sign on a Whitehall facade
In training

PolicyGym

Reason like a permanent secretary.

Public-policy decisions under real constraints. Regulation, allocation, distributional tradeoffs — graded on ex ante reasoning quality, not on whether the result was popular.

Constraint mappingDistributional honestyCounterfactual reasoning
Olympia typewriter with a sheet reading 'News'
In training

EditorGym

Kill, run, or hold.

Editorial judgment under deadline. Sourcing weight, framing, harm calculus, correction policy — graded on the dimensions taught in serious journalism schools.

Sourcing standardsPublic-interest testHarm calculus
Two figures in conversation under warm low light
In training

DialogueGym

Listen before you fix.

Clinical-grade conversation under emotional pressure. Listening, validation, repair — graded against the rubrics therapists are trained on.

Reflective listeningValidation before solutionOpen-question discipline
Candlestick chart on a dark trading screen
In training

ForecastGym

Train the rarest reasoning skill.

Calibrated forecasting. Probabilistic questions, Brier-score graded, with reasoning evaluated for base-rate use, reference-class fit, and updating discipline.

Base-rate useReference-class fitDecomposition

The lineup will keep growing — each new gym is a discipline where AI evaluating reasoning beats AI providing answers. Honourable mentions held for a later cohort: EthicsGym (applied moral reasoning), CounselGym (strategic advice), AdvocateGym (legal argument).

Methodology

The differentiator is the reasoning.

Every gym in AIGym shares one design principle. The user does not pick an answer from a list and get a tick. The user writes their reasoning — and K2 Think V2 grades the structure, completeness, and honesty of the thinking. Your answer can match the key and still score low.

Step 01

Step into a real scenario

Genuinely difficult situations — not toy problems with a hidden 'right answer'. The kind professionals actually face.

Step 02

Write your own reasoning

Choose a move, but defend it. The reasoning is what gets evaluated. Multiple-choice is the lazy version of thinking.

Step 03

Get graded by K2 Think V2

K2 grades the quality of your judgment across five discipline-specific dimensions, and shows you exactly where it broke down.

Step 04

Walk back into the room

Each gym tracks where your reasoning is sharpening — and where the same gap keeps appearing. That's the point.

House rules
Reasoning over answersGraded by K2 Think V2Quiet, editorial, seriousScenario-basedBuilt in Abu Dhabi

The room is open.

Pick a gym. Step into a scenario. Write the reasoning. K2 will be the most demanding grader you've had.