@whitewhoadie: 🚨 NVIDIA JUST DROPPED A 0.6B CPU-ONLY SPEECH RECOGNITION MODEL Nemotron-3.5-ASR delivers real-time streaming ASR across 40+ languages — and it runs entirely on CPU. Ships with: • Only 0.6B parameters • Real-time streaming output • 2.5x faster than official Nemo runtime (same accuracy) • Fully offline capable • Easy integration into local agent pipelines Another strong small model for on-device/local AI stacks. 👉 https://huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b Who’s running local speech recognition today? Drop a 🔥