✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA Enhances Llama 3.3 70B Model Performance with TensorRT-LLM

Rebeca Moen     Dec 18, 2024

#NVIDIA #LLAMA #AI #TENSORRT
AMD Ryzen AI 300 Series Enhances Llama.cpp Performance in Consumer Applications

Peter Zhang     Oct 31, 2024

#AMD #RYZEN AI #LLAMA.CPP #AI TECHNOLOGY
NVIDIA GH200 Superchip Boosts Llama Model Inference by 2x

Joerg Hiller     Oct 29, 2024

#NVIDIA #GH200 SUPERCHIP #LLAMA MODELS #AI #INFERENCE
Harnessing AMD Radeon GPUs for Efficient Llama 3 Fine-Tuning

Felix Pinkston     Oct 08, 2024

#AI #AMD RADEON #LLAMA 3 #FINE-TUNING
Boosting LLM Performance: llama.cpp on NVIDIA RTX Systems

Jessie A Ellis     Oct 02, 2024

#NVIDIA #LLAMA.CPP #RTX #AI
Ollama Enables Local Running of Llama 3.2 on AMD GPUs

Iris Coleman     Sep 27, 2024

#AMD #OLLAMA #LLM #LLAMA 3.2
NVIDIA Unveils Llama 3.1-Nemotron-51B: A Leap in Accuracy and Efficiency

Luisa Crawford     Sep 24, 2024

#NVIDIA #AI #LLAMA 3.1 #EFFICIENCY
NVIDIA Enhances Llama 3.1 405B Performance with TensorRT Model Optimizer

Lawrence Jengar     Aug 30, 2024

#NVIDIA #LLAMA 3.1 405B #TENSORRT #AI
Llama 3.1 Shows Diverse Results Across Providers, Highlighting Benchmarking Challenges

Timothy Morano     Aug 01, 2024

#LLAMA 3.1 #AI MODEL #BENCHMARKING
Meta Unveils Llama 3.1: Enhanced AI Models with Multilingual Support

Lawrence Jengar     Jul 25, 2024

#META #AI #LLAMA 3.1 #TECHNOLOGY



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.