Qwen 3.5 122b is competitive with Opus 4.6, and runs at 35t/s on a Strix Halo. I...

Qwen 3.5 122b is competitive with Opus 4.6, and runs at 35t/s on a Strix Halo. It is my daily driver.

Unlike Opus I can run abliterated models with censorship removed so it can be used for security research and reverse engineering and whatever I want with privacy, offline.

It makes any hosted models feel like a kids toy.