share
 

Why is Chinese AI startup DeepSeek stirring up the tech world?

0 Comment(s)Print E-mail Xinhua, February 1, 2025
Adjust font size:

The artificial intelligence (AI) community is abuzz with excitement over DeepSeek-R1, a new open-source model developed by Chinese startup DeepSeek.

Released on Jan. 20, it quickly soared to the top of Apple's app store's free charts by Monday, surpassing OpenAI's ChatGPT.

According to DeepSeek, in tasks such as mathematics, coding and natural language reasoning, the performance of this model is comparable to the leading models from heavyweights like OpenAI, but only at a fraction of the cash and computing power of its competitors.

Here's what DeepSeek has done and why it is taking the AI industry by surprise.

WHAT IS DEEPSEEK?

Officially known as DeepSeek Artificial Intelligence Fundamental Technology Research Co., Ltd., the firm was founded in July 2023. As an innovative technology startup, DeepSeek is dedicated to developing cutting-edge large language models (LLMs) and related technologies.

Since its first model "DeepSeek LLM" released in January last year, the company has undergone multiple rounds of iteration. In December, the startup launched its open-source LLM "V3," which overtook all of Meta's open-source LLMs and rivaled OpenAI's closed-source GPT4-o, according to U.S. media reports.

The just-released model R1 has achieved an important technological breakthrough -- using pure deep learning methods to allow AI to spontaneously emerge with reasoning capabilities.

Unlike traditional approaches like Chain-of-Thought (CoT) and Supervised Fine-Tuning (SFT), DeepSeek has distinguished itself in the AI industry by adopting Reinforcement Learning (RL) as a core training method.

While CoT and SFT rely on step-by-step reasoning and huge amounts of labeled data, respectively, RL enables models to learn through interaction and reward mechanisms, making it better suited for complex and dynamic tasks.

The adoption of RL has allowed DeepSeek to enhance its models' reasoning, adaptability and efficiency, setting it apart as a frontrunner in the field.

When queried about the meaning of "DeepSeek," its latest R1 chatbot replied, "The name reflects the company's mission to deeply explore and advance the foundational technologies of AI, aiming to push the boundaries of AI innovation and application."

"BIGGER IS NO LONGER ALWAYS SMARTER"

According to its V3 model technical report, DeepSeek's manufacturing cost is approximately 5.57 million U.S. dollars, making it the least expensive among LLMs.

Renowned U.S. economist Jeffrey Sachs, a professor and director of the Center for Sustainable Development at Columbia University, told Xinhua that the breakthrough made by DeepSeek shows the possibility of advanced AI at much lower costs than was widely believed in the United States.

DeepSeek-V3 makes it "look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2,048 GPUs for 2 months, $6M)," posted Andrej Karpathy, a founding member of OpenAI, on X.

Compared to other well-known models, DeepSeek achieved an order-of-magnitude reduction of cost.

The cost is "a stark contrast to the hundreds of millions, if not billions, that U.S. companies typically invest in similar technologies," said Marc Andreessen, a prominent tech investor, depicting DeepSeek's R1 as "one of the most amazing breakthroughs" he had ever seen.

The AI industry development has long relied on piling up computing power. The cost-efficient DeepSeek model may upend the AI landscape.

Praising the DeepSeek-V3 Technical Report as "very nice and detailed," Karpathy said that the report is worthy of reading through.

U.S. investment bank and financial service provider Morgan Stanley believed that DeepSeek demonstrates an alternative path to efficient model training than the current arm's race among hyperscalers by significantly increasing the data quality and improving the model architecture.

"Bigger is no longer always smarter," it said.

OPEN-SOURCE MODEL

"To see the DeepSeek new model, it's super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient," said Microsoft CEO Satya Nadella.

Open source allows researchers, developers and users to access the model's underlying code and its "weights" -- the parameters that determine how the model processes information -- enabling them to use, modify or enhance the model to suit their needs.

DeepSeek has greatly benefited from open-source principles and, in turn, demonstrates a strong commitment to sharing knowledge and contributing to the collective advancement of technology.

Meta's chief AI scientist Yann LeCun said: "They came up with new ideas and built them on top of other people's work. Because their work is published and open source, everyone can profit from it."

"That is the power of open research and open source," LeCun added.

Echoing LeCun, Sachs, the U.S. economist, said, "DeepSeek's business and development model is open source, which is a compelling and successful model for science, technology and business."

While OpenAI initially started as an open-source organization but later shifted to a closed-source model, DeepSeek has taken a different path.

Highlighting the importance of fostering collaboration and innovation through open-source principles, Liang Wenfeng, the founder of DeepSeek, said that building a robust technological ecosystem is the priority.

"We won't choose closed-source," Liang said.

Follow China.org.cn on Twitter and Facebook to join the conversation.
ChinaNews App Download
Print E-mail Bookmark and Share

Go to Forum >>0 Comment(s)

No comments.

Add your comments...

  • User Name Required
  • Your Comment
  • Enter the words you see:   
    Racist, abusive and off-topic comments may be removed by the moderator.
Send your storiesGet more from China.org.cnMobileRSSNewsletter
主站蜘蛛池模板: 美女被免费网站视频九色| 手机看片你懂的| 我要看黄色一级毛片| 乱人伦人妻精品一区二区| 69av免费视频| 日本高清中文字幕在线观穿线视频| 亚洲欧洲久久久精品| 男女同房猛烈无遮挡动态图 | 亚洲欧美日韩精品一区| 男女高潮又爽又黄又无遮挡| 又色又爽又黄的视频软件app | 国产成人免费视频app| 17女生主动让男生桶自己比| 国内精品久久久久久久久蜜桃| chinesevideo普通话对白| 少妇无码AV无码一区| 亚洲av日韩av无码污污网站| 男女一边桶一边摸一边脱视频免费| 四虎影院免费在线播放| 13一14周岁毛片免费| 国模一区二区三区| 99爱在线观看免费完整版| 日本人与黑人xxxx| 亚洲欧美另类久久久精品能播放的| 男女一边摸一边做爽爽| 免费看男人j放进女人j色多多| 美女和男人免费网站视频| 国产麻豆videoxxxx实拍| 中文字幕日韩精品一区二区三区 | 亚洲五月天综合| 欧美成人精品第一区二区三区| 小向美奈子中出播放| 欧美亚洲另类色国产综合| 欧美孕妇乱大交xxxx| 免费一级毛片一级毛片aa| 精品性高朝久久久久久久| 变态Sm天堂无码专区| 欧美日韩第一页| 国产精品一卡二卡三卡| 香蕉视频黄在线观看| 国产精品成人h片在线|