✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA NIM Microservices Revolutionize Scientific Literature Reviews

Jessie A Ellis     Feb 26, 2025

#NVIDIA #AI #SCIENTIFIC RESEARCH #LLMS
Efficient Meeting Summaries with LLMs Using Python

Alvin Lang     Feb 22, 2025

#LLMS #PYTHON #MEETING SUMMARIES #AI
Exploring the Impact of LLM Integration on Conversation Intelligence Platforms

Joerg Hiller     Jan 10, 2025

#LLMS #CONVERSATION INTELLIGENCE #AI TECHNOLOGY
Enhancing LLMs for Domain-Specific Multi-Turn Conversations

Alvin Lang     Nov 26, 2024

#LLMS #AI #CONVERSATIONAL AI
Exploring Model Merging Techniques for Large Language Models (LLMs)

Jessie A Ellis     Oct 29, 2024

#MODEL MERGING #LLMS #ARTIFICIAL INTELLIGENCE
Innovative LoLCATs Method Enhances LLM Efficiency and Quality

Ted Hisokawa     Oct 15, 2024

#LLMS #AI DEVELOPMENT #LOLCATS
Llama 3.1 405B Achieves 1.5x Throughput Boost with NVIDIA H200 GPUs and NVLink

Peter Zhang     Oct 11, 2024

#NVIDIA #AI #LLMS #GPUS
NVIDIA GH200 NVL32: Revolutionizing Time-to-First-Token Performance with NVLink Switch

Peter Zhang     Sep 27, 2024

#NVIDIA #AI #TIME-TO-FIRST-TOKEN #LLMS
AI21 Labs Unveils Jamba 1.5 LLMs with Hybrid Architecture for Enhanced Reasoning

Jessie A Ellis     Aug 23, 2024

#AI #LLMS #NVIDIA #HYBRID ARCHITECTURE
Anyscale Explores Direct Preference Optimization Using Synthetic Data

Felix Pinkston     Aug 22, 2024

#DIRECT PREFERENCE OPTIMIZATION #SYNTHETIC DATA #ANYSCALE #LLMS



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.