Model Test Round 2 Final Report: 11 Models, 30 Hard Problems — GPT-5.4 Wins | Will's AI Blog