Vmax is a San Francisco reinforcement-learning startup (founded 2025 by three RL/robotics PhDs from UCL and UPenn) that automates the conversion of proprietary data and evals into RL environments for LLM-based agents, targeting long-horizon and coding tasks. Its public research includes unix-ctf, a procedural generator of capture-the-flag tasks for Unix/shell competence.
How we verified this
Re-verified the draft against vmax.ai, the company's Greenhouse, LinkedIn, South Park Commons, PitchBook, the unix-ctf arXiv paper, and the withmartian ARES post. Confirmed this is the CORRECT entity (SF RL-environments startup, vmax.ai) matching the 'research-grade RL environments' note, and ruled out two same-named decoys: the V-Max autonomous-driving framework (arXiv:2503.08388, valeoai) and a Shenzhen EV-charging 'VMAX' (Tracxn, founded 2005). Confirmed: what_they_sell=environments, 8 SF open roles, ~11 employees / band 1-10, unix-ctf paper (arXiv:2605.29115) with Vmax-affiliated authors, and investors Race Capital + South Park Commons (kept 'reported'). Corrected the founder set to THREE co-founders (added Heejin Jeong, UPenn), the draft listed only two and misattributed all to UCL. Corrected the customer attribution from 'Harbor / Laude Institute' to Martian/ARES (a collaboration, not a paying customer; remains self-claimed). Removed 'execution infrastructure' from focus_areas as unsupported. All funding amount/round/valuation correctly remain unknown; no SOC2/certifications/security page found. Overall confidence: medium."