Chinese Large Language Models Lag Behind Foreign Rivals in Capabilities Assessment

A recent assessment conducted by Tsinghua University revealed that Chinese large language models (LLMs) Baidu’s Ernie Bot 4.0 and Zhipu AI’s GLM-4 rank high among domestic models, but fall short compared to foreign counterparts in overall capabilities. The SuperBench assessment found that overseas models, such as OpenAI’s GPT-4 and Anthropic’s Claude-3, demonstrated superior performance in semantic comprehension, coding abilities, and alignment with human commands. The report highlights the need for Chinese LLMs to bridge gaps in code-writing and operative abilities in real-world settings.

Scroll to Top