Claude 3.5 Sonnet outperforms previous models on SWE-bench Verified, achieving a 49% score. Learn about the enhancements and the agent framework enabling this advancement.
Claude 3.5 Sonnet Elevates Performance on SWE-bench Verified
Claude 3.5 Sonnet outperforms previous models on SWE-bench Verified, achieving a 49% score. Learn about the enhancements and the agent framework enabling this advancement.