Miami-based startup Subquadratic has launched a new model featuring a 12-million-token context window, significantly surpassing existing frontier models. This architecture utilizes Selective Attention to achieve linear scaling in compute and memory, outperforming OpenAI on key retrieval benchmarks while maintaining lower costs. The company aims to further expand this capability to a 50-million-context window in the near future.