🚀 Agenter knuser SWE Bench hardt + Polyglot benchmark-problemer