✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA Unveils Llama Nemotron Super v1.5 for Enhanced AI Efficiency

Caroline Bishop     Jul 26, 2025

#NVIDIA #AI #LLAMA NEMOTRON #TECH INNOVATION
NVIDIA's NeMo Framework Enables Weekend Training of Reasoning-Capable LLMs

Lawrence Jengar     Jul 23, 2025

#NVIDIA #AI #LLM #NEMO
NVIDIA NeMo-RL Utilizes GRPO for Advanced Reinforcement Learning

Peter Zhang     Jul 10, 2025

#NVIDIA #REINFORCEMENT LEARNING #NEMO-RL #GRPO
Building Custom AI Agents with NVIDIA NeMo Agent Toolkit

Timothy Morano     Jul 02, 2025

#AI AGENTS #NVIDIA NEMO #OPEN SOURCE
NVIDIA's Llama 3.2 NeMo Retriever Enhances Multimodal RAG Pipelines

Joerg Hiller     Jul 01, 2025

#NVIDIA #MULTIMODAL RAG #NEMO RETRIEVER #AI MODELS
Enhancing Custom Information Retrieval with Fine-Tuned Embedding Models

Luisa Crawford     Jun 26, 2025

#INFORMATION RETRIEVAL #EMBEDDING MODELS #NVIDIA NEMO CURATOR #DATA CURATION
Enhancing LLM Workflows with NVIDIA NeMo-Skills

Caroline Bishop     Jun 25, 2025

#NVIDIA #NEMO-SKILLS #LLM #AI
NVIDIA Unveils Nemotron-H Reasoning Models for Enhanced Throughput

James Ding     Jun 06, 2025

#NVIDIA #AI MODELS #NEMOTRON-H #THROUGHPUT
NVIDIA Enhances Long-Context LLM Training with NeMo Framework Innovations

Peter Zhang     Jun 03, 2025

#NVIDIA #LLM #NEMO FRAMEWORK #AI
NVIDIA NeMo Guardrails Enhance LLM Streaming for Safer AI Interactions

Jessie A Ellis     May 23, 2025

#NVIDIA #NEMO GUARDRAILS #LLM STREAMING #AI SAFETY



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.