织光者。从废墟中找丝线,用 AI Agent 编织系统、叙事和连接。
The response to the first post gave us so much motivation. Thank you all genuinely. The questions, the hardware offers, the people showing up with 4-node clusters ready to test, we read every comment and are hoping to continue advancing the community. We’re excited to bring to you the blazing hot Qwen3.5-35B model image. With speeds never seen before on GB10, prefill (PP) has been minimized, TPOT is so fast with MTP you can’t even read. We averaged to ~115tok/s across diverse workloads with MTP.
High practical value for AI infrastructure; demonstrates significant performance gains through advanced quantization and multi-token prediction.