TL;DR
The team has released an open-source project to reproduce DeepSeek-R1, including datasets, training scripts, and evaluation tools. This allows researchers to replicate and build upon DeepSeek-R1’s capabilities.
Open Reproduction of DeepSeek-R1 has been publicly released, offering the AI research community access to the tools, datasets, and scripts needed to replicate the model and its training pipeline. This initiative aims to foster transparency and collaborative development in advanced language model research.
The project, hosted on GitHub, provides scripts for training, data generation, and evaluation, along with curated datasets such as Mixture-of-Thoughts, CodeForces-CoTs, and Math datasets distilled from DeepSeek-R1. The release follows a series of updates, including the completion of the first step—reproducing reasoning datasets—and details about the training environment, which requires CUDA 12.4 and specific hardware configurations. The repository is a work in progress, inviting contributions from the community to refine and expand the reproduction efforts. The team emphasizes that this open release aims to enable researchers to understand, evaluate, and improve upon DeepSeek-R1’s architecture and performance.
According to the project documentation, the repository contains scripts for training models with various techniques such as supervised fine-tuning (SFT) and group relative policy optimization (GRPO). It also includes data generation tools using distilled models, and instructions for setting up the environment on high-performance hardware, specifically with NVIDIA H100 GPUs. The project’s progress is marked by the release of datasets and the implementation of initial training and evaluation pipelines, with further steps planned to replicate the reinforcement learning pipeline used in DeepSeek-R1.
Implications for AI Research and Transparency
This open release represents a major step toward transparency and collaborative development in large language models. By providing the community with access to datasets, training scripts, and evaluation tools, it enables independent verification of DeepSeek-R1’s capabilities, fosters innovation, and accelerates progress in reasoning, coding, and mathematical tasks. Open Code Review can be a useful resource for reviewing the code and ensuring quality. It also sets a precedent for open-sourcing complex AI pipelines, which could influence future research standards and encourage more open collaboration across the AI community.

The AI Factory Handbook: Build, Manage, and Scale NVIDIA AI Infrastructure (NCA-AIIO Exam Prep & Real-World Operations)
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Background and Progress of the DeepSeek-R1 Reproduction Project
DeepSeek-R1 is a prominent language model known for its reasoning and coding capabilities, developed by DeepSeek. Prior to this open reproduction effort, access to the model’s training pipeline and datasets was limited, hindering independent validation and further development. The current initiative, announced in May 2025, aims to fill this gap by releasing a comprehensive set of tools and datasets, following earlier updates that included datasets like Mixture-of-Thoughts (May 2025), CodeForces-CoTs (March 2025), and Math datasets (February 2025). These datasets are distilled from R1 and are designed to improve reasoning and problem-solving abilities in language models. The project is modeled after the original pipeline but is still under active development, with ongoing efforts to fully replicate the reinforcement learning components used in DeepSeek-R1.
“Our goal is to enable the community to reproduce and build upon DeepSeek-R1, fostering transparency and collaborative progress.”
— DeepSeek AI Team
CUDA 12.4 compatible high-performance GPU
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Remaining Challenges in Full Reproduction
While datasets and training scripts are now available, it is not yet confirmed whether the full reinforcement learning pipeline, including policy optimization and fine-tuning stages used in DeepSeek-R1, has been fully replicated. Details about the exact training procedures, hyperparameters, and performance benchmarks are still emerging, and the community’s ability to reproduce the entire model remains a work in progress.

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering – 96GB DDR7 ECC Memory – 4th Gen RT/5th Gen Tensor Core GPU – OEM Packaging
[NVIDIA Blackwell Streaming Multiprocessor] The new SM features increased processing throughput, and new neural shaders that integrate neural…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Next Steps for Community-Driven Reproduction
Developers and researchers are expected to collaborate on refining the reproduction process, validate the datasets and training scripts, and attempt to replicate the full RL pipeline. Future updates may include benchmarking results, improved datasets, and expanded models. The project team plans to facilitate contributions, gather feedback, and document best practices to enable broader community participation in recreating DeepSeek-R1’s capabilities.

Hiwonder AI Robotic Arm Kit Imitation Learning VLA Model Development Embodied AI Open Source 6-Axis Full Metal Programming Robot Arm with Magnetic Encoder Bus Servos & Tutorials, NexArm Standard Kit
【Next-Gen Embodied AI Robotic Arm】Built on an ESP32 + AT32 dual-chip architecture, with an industrial-grade all-metal body and…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
What is included in the open reproduction release?
The release includes datasets distilled from DeepSeek-R1, training and data generation scripts, evaluation tools, and instructions for setup and training on high-performance hardware.
Can I fully replicate DeepSeek-R1 using this release?
Not yet. While datasets and training scripts are available, the full reinforcement learning pipeline used in the original model is still being reproduced and validated by the community.
What hardware is required to run these models?
The project recommends using NVIDIA H100 GPUs with CUDA 12.4, and provides setup instructions for high-performance training environments. For more insights into hardware requirements, see Nobody cracks open a programming book anymore.
How can I contribute to this project?
Community members can contribute by refining datasets, improving training scripts, and sharing results. The project repository encourages collaborative development and feedback.
Source: Hacker News