DeepSeek unveiled a compact version of the R1 AI model to run on a single GPU

Author: NaKmo Flow | 5/30/2025

DeepSeek unveiled a compact version of the R1 AI model to run on a single GPU

Chinese company DeepSeek has released a new, lightweight version of its R1 language model, designated DeepSeek-R1-0528-Qwen3-8B. Unlike the original R1, the new product can efficiently run only one GPU, making it more affordable for individual developers and small teams.


The emphasis is on logic and reasoning

The new model is based on Qwen3-8B from Alibaba and is optimized for tasks requiring analysis, logical inference and reasoning. In particular, it performs well in profile tests:

  • AIME 2025
  • HMMT (Harvard-MIT Math Tournament)


Model Characteristics

ParameterValue
NameNameDeepSeek-R1-0528-Qwen3-8B
ArchitectureQwen3-8B (from Alibaba)
GPU SupportSingle GPUOne GPU
Major specializationMathematics, logic, reasoning
LicenseMIT (open, including commercial use)
Distribution PlatformHugging Face
Language PolicyModerately censored, with mild disclaimers


More open but cautious

The updated version of R1 has changed its approach to moderation: previously the model could directly avoid answering politically sensitive questions (e.g. about the Tiananmen Square events or comparing leaders to pop culture characters). Now the model simply politely evades answering with the phrase:

"Sorry, that's beyond my current scope. Let's talk about something else."

This change seems to be made to balance transparency, security and accessibility in an international context.


Who is this model for?

The compact version of the DeepSeek R1 can be particularly useful:

  • Researchers and students working with logic and math
  • Startups that can't afford to run GPT-4 scale models
  • Developers implementing AI in on-premises solutions without cloud connectivity


The model is already available for download and use at Hugging Face.

Recommended for you

Carnegie Mellon is grooming the next generation of data and AI leaders

Everything Google announced at I/O 2025: AI is advancing on all fronts

OpenAI implements GPT-4.1 in ChatGPT to improve programming