Baidu Ranks Top In Chinese ChatGPT-Style Chatbot Tests

Baidu’s Ernie Bot ranked highest in Xinhua Institute tests of chatbots that have been made available in recent months by Chinese tech firms seeking to compete with ChatGPT.

Xinhua Institute, a think tank affiliated with the Xinhua news agency, said its tests found Ernie Bot led Alibaba’s Tongyi Qianwen, SparkDesk from voice recognition firm iFlytek, and SenseChat from image recognition company SenseTime.

But the Baidu chatbot still trailed Microsoft-backed OpenAI’s latest GPT-4 and GPT-3.5, which powered the version of ChatGPT released to the public last November.

The researchers tested general capabilities such as basic language skills, logical reasoning and subject knowledge in fields such as mathematics, physics, finance and literature.

Baidu chief executive Robin Li speaks at the World Artificial Intelligence Conference in July 2021. Image credit: Baidu

Competition

They also tested the bots’ ability to improve productivity in fields such as journalism, painting, design, marketing, law and research.

ChatGPT captured the public’s imagination on its release last year, quickly gaining more than 100 million users, and competitors including Google and Amazon quickly announced their own versions of the technology.

In China, where ChatGPT is not officially available, Ernie Bot became the first OpenAI competitor last March.

But Baidu suffered an backlash as investors reacted to what they felt was a disappointing prerecorded demonstration of the chatbot’s capabilities.

‘Subjectivity’

Baidu’s shares have experienced a surge since the end of last month, when the company announced it would soon launch a new version of the large language model (LLM) powering Ernie Bot.

But they are still trading about 20 percent below their peak in early February, when investors were hotly awaiting the company’s initial chatbot announcement.

A different test of chatbots by Clue, a Chinese website that tracks AI research, found Smart Brain from cybersecurity firm Qihoo 360 was the best performer, followed by SparkDesk.

Xinhua Institute acknowledged in its report that tests were subject to time and conditional constraints that may result in “a certain degree of subjectivity”.

Matthew Broersma

Matt Broersma is a long standing tech freelance, who has worked for Ziff-Davis, ZDnet and other leading publications

Recent Posts

Silicon UK Unveils a Bold New Redesign

Discover Silicon UK's bold new redesign—enhancing tech journalism, user experience, and client campaigns with a…

10 hours ago

Meta To Spend ‘Hundreds Of Billions’ On AI Data Centres

Meta vows to spend hundreds of billions of dollars on AI infrastructure as it seeks…

1 day ago

Pentagon Awards AI Contracts To OpenAI, Google, Anthropic, xAI

US Defence Department hands contracts of up to $200m each to AI leaders as White…

1 day ago

OpenAI Delays Open Source Model Indefinitely

OpenAI pushes back release of hotly anticipated open-source model indefinitely as it seeks to compete…

1 day ago

HSBC Sees $40bn Driverless Taxi Market In China

Driverless taxis could see $40bn in revenues a year in mainland China, boosted by cutting-edge…

1 day ago