Deedy

@deedydas

7/16/2025, 2:43:52 AM

Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions.

It gets 2x inference speed, reduced training FLOPs and ~50% reduced KV cache memory. Really interesting read.

Has potential to be a Transformers killer. https://t.co/LdrKmSy6tR

Source: https://t.co/w9LyT6WdMW

Explore

TwitterXDownload

v1.2.1

The fastest and most reliable Twitter video downloader. Free to use, no registration required.