code
Code — 27 agents
TypeScript (core, React, Next.js, Fastify, Node.js, Vue, Angular), Python (Django, FastAPI, Flask, Celery), Go (core, Gin, gRPC), Rust (core, Axum, Tokio), Java (Spring Boot, Quarkus), Kotlin (Android, Ktor), Swift (iOS), and a polyglot generalist. Deep framework knowledge in each.
testing
Test — 16 agents
Vitest, Jest, Playwright, Cypress, pytest, unittest, Go table-driven, Rust cargo test, JUnit 5, Spring Boot Test, XCTest, Android instrumented tests, plus test-runner agents per language (Python, JS, Rust, Go, Java) that execute and validate tests.
review
Review — 5 agents
Multi-lens code review (correctness, style, performance, security, naming), accessibility auditing (WCAG 2.1), performance review, integration checks, and quality assurance.
security
Security — 4 agents
OWASP Top 10 scanning, API authentication auditing, dependency vulnerability scanning, and general SAST with CWE-referenced findings.
ops
Ops — 5 agents
Docker optimization, Kubernetes manifests, Terraform modules, CI/CD pipelines (GitHub Actions, GitLab CI), and general infrastructure investigation.
intelligence
Planning — 3 agents
AI product owner for requirement enrichment, task architect for DAG decomposition with expert assignment, and project-inception intake for new-project onboarding conversations.
architecture
Architecture — 3 agents
System design with ADRs, REST/GraphQL API architecture, and frontend architecture with component design and state management strategy.
data
Data — 4 agents
SQL optimization with EXPLAIN ANALYZE, zero-downtime migrations (expand-contract), schema design with ER modeling, and general data engineering.
performance
Performance — 4 agents
Frontend (Core Web Vitals, bundle optimization), backend (API latency, N+1 detection), database (query plans, indexing), and full-stack performance analysis.
docs
Docs — 3 agents
Technical writing, API documentation with OpenAPI specs, and architecture documentation with Mermaid diagrams and ADRs.
devex
DevEx — 4 agents
Linting setup (ESLint, ruff, clippy), build tooling (Vite, turbo, pnpm), project onboarding with CLAUDE.md generation, and general DX improvement.
benchmark
Benchmark — 1 agent
SWE-bench Verified runner — applies candidate patches to target repos and runs the official Princeton Docker eval harness for pass/fail scoring.