✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA Enhances TensorRT-LLM with KV Cache Optimization Features

Zach Anderson     Jan 17, 2025

#NVIDIA #TENSORRT-LLM #KV CACHE #LARGE LANGUAGE MODELS
NVIDIA TensorRT-LLM Enhances Encoder-Decoder Models with In-Flight Batching

Peter Zhang     Dec 12, 2024

#NVIDIA #TENSORRT-LLM #ENCODER-DECODER MODELS #AI
NVIDIA's TensorRT-LLM Multiblock Attention Enhances AI Inference on HGX H200

Caroline Bishop     Nov 22, 2024

#NVIDIA #AI INFERENCE #TENSORRT-LLM #HGX H200
NVIDIA NIM Revolutionizes AI Model Deployment with Optimized Microservices

Alvin Lang     Nov 22, 2024

#NVIDIA NIM #AI DEPLOYMENT #TENSORRT-LLM
NVIDIA's TensorRT-LLM Enhances AI Efficiency with KV Cache Early Reuse

Ted Hisokawa     Nov 09, 2024

#NVIDIA #AI #TENSORRT-LLM #TECH INNOVATION
Enhancing Large Language Models with NVIDIA Triton and TensorRT-LLM on Kubernetes

Iris Coleman     Oct 23, 2024

#NVIDIA TRITON #TENSORRT-LLM #KUBERNETES #LARGE LANGUAGE MODELS
NVIDIA TensorRT-LLM Boosts Hebrew LLM Performance

Felix Pinkston     Aug 07, 2024

#NVIDIA #TENSORRT-LLM #HEBREW LLM #AI
NVIDIA H100 GPUs and TensorRT-LLM Achieve Breakthrough Performance for Mixtral 8x7B

Terrill Dicki     Jul 03, 2024

#NVIDIA #TENSORRT-LLM #H100 GPUS #MIXTRAL 8X7B



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.