✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

Efficient AI Pipelines: NVIDIA's NeMo Retriever Extraction on a Single GPU

Lawrence Jengar     Jun 19, 2025

#NVIDIA #AI #MULTIMODAL EXTRACTION #GPU
NVIDIA's ITMonitron Revolutionizes Real-Time IT Incident Detection

Felix Pinkston     Jun 19, 2025

#NVIDIA #AI #ITMONITRON #INCIDENT DETECTION
NVIDIA Enhances Multi-GPU Communication with NCCL 2.26 Release

Darius Baruo     Jun 19, 2025

#NVIDIA #NCCL #GPU #AI #HPC
NVIDIA's GB200 NVL72 Boosts AI Model Evaluation at UC Berkeley's LMArena

Tony Kim     Jun 18, 2025

#NVIDIA #AI #LMARENA #GB200 NVL72
NVIDIA Advances ML in Manufacturing with CUDA-X Data Science

Felix Pinkston     Jun 18, 2025

#NVIDIA #MACHINE LEARNING #CUDA-X #MANUFACTURING
NVIDIA Launches G-Assist Plug-In Hackathon for AI Enthusiasts

Felix Pinkston     Jun 18, 2025

#NVIDIA #AI #HACKATHON #G-ASSIST
NVIDIA Enhances LLMOps for Efficient Model Evaluation and Optimization

Rongchai Wang     Jun 17, 2025

#NVIDIA #LLMOPS #AI #MACHINE LEARNING
NVIDIA's Project G-Assist: Building Twitch-Integrated Plug-ins

Peter Zhang     Jun 17, 2025

#NVIDIA #AI #TWITCH #PLUG-INS
NVIDIA Unveils AI Reference Apps for Real-Time Media Effects on Holoscan

Iris Coleman     Jun 17, 2025

#NVIDIA #AI #HOLOSCAN #MEDIA
NVIDIA Research Advances 3D Robot Perception with New AI-Based Models

Darius Baruo     Jun 17, 2025

#NVIDIA #AI #3D PERCEPTION #ROBOTICS



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.