✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA's GB200 NVL72 and Dynamo Enhance MoE Model Performance

Lawrence Jengar     Jun 06, 2025

#NVIDIA #MOE MODELS #AI #INFERENCE PERFORMANCE
NVIDIA Unveils Nemotron-H Reasoning Models for Enhanced Throughput

James Ding     Jun 06, 2025

#NVIDIA #AI MODELS #NEMOTRON-H #THROUGHPUT
NVIDIA Advances Speech AI with Cutting-Edge Parakeet and Canary Models

James Ding     Jun 05, 2025

#NVIDIA #SPEECH AI #ASR MODELS
FLUX.1 Kontext: Revolutionizing Image Editing with Multi-Modal Models

Peter Zhang     May 31, 2025

#IMAGE EDITING #AI MODELS #BLACK FOREST LABS
BNB Chain Advances AI Integration with Model Context Protocol (MCP)

Iris Coleman     May 22, 2025

#BNB CHAIN #AI INTEGRATION #MODEL CONTEXT PROTOCOL
NVIDIA Unveils AI Blueprint for Advanced Video Analytics

Rebeca Moen     May 19, 2025

#NVIDIA #AI #VIDEO ANALYTICS #VISION LANGUAGE MODELS
NVIDIA NeMo Enhances Hugging Face Model Integration with AutoModel Feature

Rebeca Moen     May 13, 2025

#NVIDIA #AI MODELS #HUGGING FACE #NEMO FRAMEWORK
NVIDIA Unveils NV-Tesseract Models to Revolutionize Time-Series Data Processing

Luisa Crawford     May 11, 2025

#NVIDIA #TIME-SERIES DATA #MACHINE LEARNING #AI MODELS
Arcee AI Transitions from AWS to Together AI for Enhanced Flexibility and Performance

Peter Zhang     May 06, 2025

#AI MODELS #CLOUD MIGRATION #ARCEE AI
NVIDIA Launches Secure AI General Availability with Enhanced Protection for Large Language Models

Joerg Hiller     Apr 30, 2025

#NVIDIA #AI SECURITY #LARGE LANGUAGE MODELS #CONFIDENTIAL COMPUTING



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.