
Kodwai is a coding challenge platform by kodwai.com that lets you solve real-world problems on your machine using AI agents like Claude Code, Cursor, and Codex. Compete on leaderboards, build your pro
Kodwai is a coding challenge platform where developers solve real-world problems on their own machine using AI agents like Claude Code, Cursor, or Codex. Instead of testing memorized algorithms, it scores how well you direct an agent—catching hallucinations, verifying outputs, and shipping real code. The platform ranks you on a public leaderboard based on your ability to manage AI-driven development. It’s designed to measure the skill that matters in modern engineering: directing an agent effectively, not just passing tests.
AI agent skill assessment
Developers test their ability to prompt, verify, and recover from agent errors in real coding tasks.
Team hiring or internal evaluation
Engineering teams can use Kodwai to evaluate candidates or current staff on agent-directed problem-solving.
Competitive coding with AI
Compete on leaderboards by solving challenges with your own agent, comparing scores on direction, outcome, and lift.
Skill benchmarking
See how your agent-handling skills rank against other developers in a public, transparent way.
Learning and improvement
Review per-signal evidence (direction, outcome, lift) to understand why you scored the way you did and improve.
Real-world workflow simulation
Work on ticket-sized problems in your own editor, with no sandbox constraints, mimicking how you actually build software.
Bring your own agent
Use Claude Code, Cursor, or Codex—no proprietary AI required; you work with the agent you already use.
Five-step challenge flow
Pick a challenge, run a CLI command, solve on your machine, submit with one command, and get scored.
CLI-based submission
Run `$npx @kodwai/cli submit` to package code, git history, test runs, agent transcript, and time for scoring.
Three-score evaluation
You are scored on Direction (how well you direct the agent), Outcome (what shipped), and Lift (improvement over baseline).
Per-signal evidence
Each score axis lands with specific evidence so you can see why you scored that way.
Public leaderboard
Rankings are visible to all, showing how your agent-handling skill compares.
Real-world problems
Challenges are ticket-sized, covering categories you actually ship in, with difficulty filters.
No sandbox constraints
Work in your own editor and terminal with your own agent—no artificial limitations.
Git history tracking
The CLI inits a git repo and tracks your session, including agent transcript and test runs.
Free to start
Challenges are fully free; you bring your own agent (Claude Code, Cursor, or Codex).
Kodwai is built for developers who work with AI agents daily—engineers who want to prove their skill at directing agents, not just memorizing LeetCode patterns. It’s also useful for engineering managers hiring for modern roles, and for teams evaluating how well members handle agent-driven development. The platform targets anyone who believes the real skill is in catching agent errors, writing specs, and shipping verified code.
$npx @kodwai/cli challenge to download PROBLEM.md, starter files, and tests. Choose your agent (Claude Code, Cursor, or Codex).$npx @kodwai/cli submit to package your code, git history, test runs, agent transcript, and time for scoring.Kodwai is fully free to start. The website states: "Start a challenge→fully free/bring your own agent/claude code, cursor or codex." There is no mention of paid tiers or trial limitations in the provided text.
Kodwai addresses a genuine gap in developer assessment—LeetCode-style tests don’t measure how well you handle AI agents in real workflows. The three-score system (Direction, Outcome, Lift) with per-signal evidence provides transparent, actionable feedback, which is rare in coding challenge platforms. The CLI-based flow is straightforward and respects how developers actually work, avoiding sandbox friction. However, the platform’s value depends entirely on the quality and relevance of its challenge library, which isn’t detailed in the text. For developers already using AI agents daily, Kodwai offers a practical, competitive way to benchmark a skill that’s increasingly central to modern engineering.
Kodwai is a coding challenge platform by kodwai.com that lets you solve real-world problems on your machine using AI agents like Claude Code, Cursor, and Codex. Compete on leaderboards, build your pro
Category:Code generation
Visit Link:https://www.kodwai.com/
Tags:AI coding challenges、real-world coding problems、AI agent competition、developer leaderboard