|
- DeepSeek | 深度求索
基于自研训练框架、自建智算集群和万卡算力等资源,深度求索团队仅用半年时间便已发布并开源多个百亿级参数大模型,如DeepSeek-LLM通用大语言模型、DeepSeek-Coder代码大模型,并在2024年1月率先开源国内首个MoE大模型(DeepSeek-MoE),各大模型在公开评测榜单及
- DeepSeek - Free AI Chat
Chat with DeepSeek AI for free Get instant help with writing, coding, math, research, and more No signup required
- DeepSeek | 深度求索 - 官方网站
DeepSeek-V3 的综合能力 DeepSeek-V3 在推理速度上相较历史模型有了大幅提升。 在目前大模型主流榜单中,DeepSeek-V3 在开源模型中位列榜首,与世界上最先进的闭源模型不分伯仲。
- DeepSeek深度求索官网
DeepSeek推出革命性产品R1系列模型,采用创新的MLA(多投潜注意力)算法与知识蒸馏技术,在保持与OpenAI顶尖模型相当性能的同时,将训练成本压缩至1 70。 该模型支持140种语言交互,登顶全球140个市场应用商店下载榜,成为首个登顶国际主流市场榜首的中国AI产品。
- DeepSeek - AI Assistant V3 Chat
DeepSeek is a Chinese company specializing in artificial intelligence, particularly in natural language processing (NLP) and large language models (LLMs) It develops advanced AI technologies for applications like conversational AI, content generation, and data analysis
- DeepSeek AI
DeepSeek AI is a Chinese artificial intelligence research company known for developing powerful large language models Their flagship models include DeepSeek-V3 (a general-purpose LLM with 671B parameters) and DeepSeek-R1 (a reasoning-focused model that shows its thinking process)
- DeepSeek - Wikipedia
Based in Hangzhou, Zhejiang, DeepSeek is owned and funded by the Chinese hedge fund High-Flyer DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both of the companies [7][8][9] The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025
- DeepSeek · GitHub
Python 22,743 MIT 2,092 250 (3 issues need help) 38 Updated on Jan 26 DualPipe Public A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3 R1 training
|
|
|