Anthropic’s newest Claude chatbot beats OpenAI’s GPT-4o in some benchmarks


anthropic rolled his newest AI language model Thursday, Claude 3.5 Sonnet. The updated chatbot outperforms the company’s previous high-end model, the Claude 3 Opus, while running at twice the speed. Claude users (including those on free accounts) can check out starting today.

The Sonnet, which tends to be Anthropic’s most balanced model, is the first release in the Claude 3.5 family. The company says the Claude 3.5 Haiku (the fastest in any generation) and the Claude 3.5 Opus (the most powerful) will arrive later this year. (Those models will remain on version 3 in the meantime.) The Sonnet update is coming just a few months later. Arrival of the Claude 3 family, AI companies are trying to expose the latest and greatest.

Chart showing benchmark comparisons between the latest AI chatbot models: Claude 3.5 Sonnet, Claude 3 Opus, GPT-4o, Gemini 1.5 Pro and Llama-400b.Chart showing benchmark comparisons between the latest AI chatbot models: Claude 3.5 Sonnet, Claude 3 Opus, GPT-4o, Gemini 1.5 Pro and Llama-400b.

anthropic

Anthropic Claims Claude 3.5 The Sonnet is a step forward in understanding nuance, humor, and complex allusions, and he is able to write in a more natural tone. Benchmarks (above) show the new model breaking industry records for graduate-level thinking, undergraduate-level knowledge and coding ability. It beats OpenAI’s GPT-4o On many criteria published by Anthropic. However, the latest Claude, ChatGPT, Twins and Llama the models score within a few percentage points of each other in most tests, highlighting the close competition.

The company claims that the Claude 3.5 Sonnet is better at interpreting visual input than the Claude 3.0 Opus. Anthropic says the new model can “accurately transcribe text from imperfect images,” a capability it hopes will appeal to customers in retail, logistics and financial services who need information from charts, graphs and other visual cues.

Claude’s update also brings a new workspace the company calls Artifacts (above). When you ask the chatbot to create content such as code, text documents, or web design, a special window appears to the right of the conversation. From there, you can prompt Claude to make changes, and he’ll update the Artifacts window with the latest output.

The company sees Artifacts as the first step toward making Cloud a space for broader team collaboration. “In the near future, teams and eventually entire organizations will be able to securely centralize their knowledge, documents, and work-in-progress in one shared space, with Claude serving as an on-demand teammate,” the company wrote in a press release. .

Claude 3.5 Sonnet is now available to anyone with an account his websiteas well as Claude iOS app. (On both of those platforms, Claude Pro and Team subscribers get a higher token count.) You can also access via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. It’s $3 for one million input tokens and $15 for one million output tokens – the same as the previous model.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *