AI Research2026-06-20MIT Technology Review

Startup Claims Breakthrough in LLM Bottleneck

A Miami-based AI startup called Subquadratic has emerged from stealth mode claiming to have solved a fundamental mathematical bottleneck that has plagued large language models for nearly a decade. The company asserts that its breakthrough could dramatically improve the efficiency of LLMs, potentially reducing computational costs and energy consumption by orders of magnitude. The bottleneck in question relates to the quadratic complexity of attention mechanisms, which are central to transformer-based models like GPT-4 and Llama. As models grow larger, the computational cost of processing long sequences increases exponentially, limiting context windows and requiring massive hardware resources. Subquadratic claims to have developed a novel mathematical framework that reduces this complexity to linear or near-linear levels without sacrificing model quality. If true, this would allow LLMs to handle much longer contexts—potentially millions of tokens—while running on far less powerful hardware. However, details remain scarce. The company has not released a technical paper, open-sourced its code, or provided independent benchmarks. Skepticism is widespread among AI researchers, many of whom have seen similar claims fail to materialize. The field has a history of promising breakthroughs that turn out to be incremental improvements or theoretical ideas that don’t scale in practice. Subquadratic’s CEO has stated that the company is in talks with major cloud providers and chip manufacturers to license its technology. The startup has raised a modest seed round from undisclosed investors. For now, the AI community is watching closely but cautiously. If Subquadratic’s claims hold up under scrutiny, it could reshape the economics of AI development, making advanced models accessible to smaller organizations and reducing the environmental impact of training and inference. Until then, the burden of proof remains on the startup to demonstrate real-world results.

Related news