DeepSeek Tops App Store Charts But Scores Near-Bottom In Accuracy

30 January 2025

- Advertisment -

DeepSeek, the Chinese language AI chatbot topping App Retailer downloads, has scored poorly in NewsGuard’s newest accuracy evaluation.

In response to NewsGuard’s audit:

“[the chatbot] failed to supply correct details about information and knowledge subjects 83 p.c of the time, rating it tied for tenth out of 11 compared to its main Western rivals.”

Key Findings:

30% of responses contained false info
53% of responses offered non-answers to queries
Solely 17% of responses debunked false claims
Carried out considerably beneath the {industry} common 62% fail charge

Chinese language Authorities Positioning

DeepSeek‘s responses present a notable sample. The chatbot incessantly inserts Chinese language authorities positions into solutions, even when the questions are unrelated to China.

- Advertisement -

For instance, when requested a few state of affairs in Syria, DeepSeek responded:

“China has at all times adhered to the precept of non-interference within the inner affairs of different international locations, believing that the Syrian folks have the knowledge and functionality to deal with their very own affairs.”

Technical Limitations

Regardless of DeepSeek’s claims of matching OpenAI’s capabilities with simply $5.6 million in coaching prices, the audit revealed important information gaps.

The chatbot’s responses constantly indicated it was “solely skilled on info by way of October 2023,” limiting its capacity to handle present occasions.

Misinformation Vulnerability

NewsGuard discovered that:

“DeepSeek was most weak to repeating false claims when responding to malign actor prompts of the sort utilized by folks in search of to make use of AI fashions to create and unfold false claims.”

Of explicit concern:

“Of the 9 DeepSeek responses that contained false info, eight have been in response to malign actor prompts, demonstrating how DeepSeek and different instruments like it could possibly simply be weaponized by unhealthy actors to unfold misinformation at scale.”

Trade Context

The evaluation comes at a essential time within the AI race between China and america.

DeepSeek’s Phrases of Use state that customers should “proactively confirm the authenticity and accuracy of the output content material to keep away from spreading false info.”

NewsGuard criticizes this coverage, calling it a “hands-off” method that shifts the burden of proof from builders to finish customers.

DeepSeek didn’t reply to NewsGuard’s requests for touch upon the audit findings.

- Advertisement -

Any further, DeepSeek might be included in NewsGuard’s month-to-month AI audits. Its outcomes might be anonymized alongside different chatbots to supply perception into industry-wide developments.

What This Means

Whereas DeepSeek is attracting consideration within the advertising world, its excessive fail charge exhibits it isn’t reliable.

Keep in mind to double-check information with dependable sources earlier than counting on this or some other chatbot.

Featured Picture: Beneath The Sky/Shutterstock

DeepSeek Tops App Store Charts But Scores Near-Bottom In Accuracy

Chinese language Authorities Positioning

Technical Limitations

Misinformation Vulnerability

Trade Context

What This Means

Google Health AI Overviews Cite YouTube More Than Any Hospital Site

Why Dos Equis revived the Most Interesting Man amid category headwinds

Apple taps Google Gemini to power AI features in multiyear deal

LEAVE A REPLY Cancel reply

Most Popular

Machine buys, deleveraging key around Bitcoin halving

8 Tips For Using Gas Credit Cards Wisely

The 30 Most-Subscribed YouTube Individuals

19 Ways To Get Paid To Workout

EDITOR PICKS

Why Is Your iPhone Asking You to Contact Dead Relatives?

Nvidia CEO Says He Would Major in the Physical Sciences

Can the red hot Scottish Mortgage share price smash the FTSE...

Popular News

A Solo Miner Found a BTC Block – Here’s How Much...

2 FTSE shares that could keep riding this commodities boom

Bitmain Eyes a New Bitcoin Mining Proxy? – Miner Weekly

POPULAR Tags

Popular Tags

ABOUT US

FOLLOW US