Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

The battle between AI chatbots is more than just a two-horse race. anthropic, a company founded by several former OpenAI employees, claims that its new Claude 3 language model outperforms ChatGPT and Google’s Gemini in several key industry benchmarks. He even reached “near human” levels in some positions, the company wrote on the blog.

There are three new chatbots under the Claude 3 umbrella, including Haiku, Sonnet, and Opus. The sonnet is empowering chatbot and is offered free with email access. Meanwhile, Opus is the largest and most powerful LLM and will be available with a monthly subscription of $20 through the “Claude Pro” service. It’s also multi-modal, so unlike past versions it can handle both text and image inputs.

All Claude 3 models “can power live customer chats, autocompletes and data extraction tasks where responses need to be immediate and in real-time,” he said. In addition to promising “instant results”, they can handle longer, multi-step instructions with increased accuracy.

Anthropic says its new Claude 3 AI chatbot outperformed GPT-4 on key benchmarks.Anthropic says its new Claude 3 AI chatbot outperformed GPT-4 on key benchmarks.


Opus showed better graduate-level reasoning than GPT-4, scoring 14.7 percent higher than GPT-4 on this test. It also outperformed OpenAI’s chatbot in tasks involving math, coding, reasoning and knowledge.

They also outperform previous Claude models. “For the vast majority of workloads, Sonnet is twice as fast as the highly intelligent Claude 2 and Claude 2.1. It excels in tasks that require fast response, such as knowledge retrieval or sales automation. Opus delivers similar speeds to Claude 2 and 2.1, but with higher levels of intelligence,” according to Anthropic.

Meanwhile, the Haiku, the smallest version of the Claude 3, is “the fastest and most affordable model on the market.” To that end, it can read a dense research paper full of charts and graphs in less than three seconds.

The company also noted that Claude 3 “can handle a wide variety of visual formats, including images, charts, graphs, and technical diagrams,” which helps companies using PDFs, flowcharts, or presentation slides. Also, in addition to recognizing “real harm”, a more nuanced understanding of queries will make it less likely to reject harmless content.

Anthropic said that Claude is an AI leads 10 hidden pillars of justice. Claude has been trained on both non-public internal and public data using 3 Amazon Web Services (AWS) and Google Cloud (Amazon) hardware. recently invested $4 billion in Anthropic).

Claude 3 Opus and Claude 3 Sonnet are now available through Anthropic’s API, and Haiku will follow soon. Sonnet is also available through Amazon Bedrock and on private preview Google Cloud’s Vertex AI Model Garden.

This article contains affiliate links; we may earn a commission if you click on such a link and make a purchase.

Source link

Leave a Reply

Your email address will not be published. Required fields are marked *