Augment Code

@augmentcode

12:02 AM · Apr 15, 2025

𝐆𝐏𝐓-𝟒.𝟏 𝐚𝐥𝐦𝐨𝐬𝐭 𝐭𝐨𝐩𝐬 𝐂𝐥𝐚𝐮𝐝𝐞 𝟑.𝟕 𝐨𝐧 𝐜𝐨𝐝𝐢𝐧𝐠?!

New eval dropping using our #1 SWE-bench coding agent!

- GPT-4.1 beats Gemini 2.5 Pro and almost tops Claude 
   3.7 Sonnet!
- Even GPT-4.1 mini matches Claude 3.5 Sonnet V2 
   performance. It was the top model just 2mo ago!

The evaluation is done through our proprietary codebase understanding benchmark AugmentQA. You can learn more at: https://www.augmentcode.com/blog/you-make-your-evals-then-your-evals-make-you-introducing-augmentqa

Try our agent yourself at: http://www.augmentcode.com.

共有

探検する

TwitterXDownload

v1.4.62

Download Twitter videos and media content for free. No registration required. Fast and easy Twitter video downloader. Twitter Media Saver. Twitter X Download.

その他のリンク

関連商品

English 简体中文繁體中文 हिन्दी Español Français Deutsch বাংলা Русский Português اردو 日本語 한국어 Tiếng Việt Italiano ไทย Türkçe

© 2024 TwitterXDownload 無断転載を禁じます。

support@twitterxdownload.com