Qwen 3.5 122b is competitive with Opus 4.6, and runs at 35t/s on a Strix Halo. It is my daily driver.
Unlike Opus I can run abliterated models with censorship removed so it can be used for security research and reverse engineering and whatever I want with privacy, offline.
Unlike Opus I can run abliterated models with censorship removed so it can be used for security research and reverse engineering and whatever I want with privacy, offline.
It makes any hosted models feel like a kids toy.