Inside the labs racing to make reasoning models run on your phone
A new generation of sub-3B-parameter models is closing the gap with frontier systems — and they're doing it on silicon that fits in your pocket. We spent three weeks with the teams redesigning inference from the chip up.