

Light R1 14B DS
Overview :
Light-R1-14B-DS is an open-source mathematical model developed by Qihoo 360 Technology Co., Ltd. Trained using reinforcement learning based on DeepSeek-R1-Distill-Qwen-14B, it achieved high scores of 74.0 and 60.2 on the AIME24 and AIME25 mathematics competition benchmarks, respectively, surpassing many 32B parameter models. It successfully implemented reinforcement learning on an already long-chain reasoning fine-tuned model under a lightweight budget, providing the open-source community with a powerful mathematical model tool. Its open-source nature promotes the application of natural language processing in education, particularly in mathematical problem-solving, offering researchers and developers valuable research foundations and practical tools.
Target Users :
This model is suitable for researchers and developers in natural language processing, especially those focusing on mathematical problem-solving, educational applications, and reinforcement learning. It provides an excellent reference for teams aiming for high-performance model training on a lightweight budget, enabling quick adoption and research and development.
Use Cases
Researchers can utilize this model to research and improve mathematical problem-solving algorithms.
Developers can build educational applications based on this model to help students better solve mathematical problems.
Businesses can apply this model to intelligent customer service systems to improve the ability to answer math-related questions.
Features
Reinforcement learning-based long-chain reasoning training enhances mathematical problem-solving capabilities.
Open-source model facilitates secondary development and research by researchers and developers.
Excellent performance in mathematical benchmark tests such as AIME24 and AIME25, with high accuracy.
Supports efficient training under a lightweight budget, reducing computational costs.
Provides detailed training logs and technical reports for easy understanding and reproducibility.
How to Use
1. Visit the Hugging Face website and locate the Light-R1-14B-DS model page.
2. Download the model files and related resources, including training logs and technical reports.
3. Load the model using a supported framework, such as PyTorch or TensorFlow.
4. Fine-tune the model or apply it directly to mathematical problem-solving tasks based on specific needs.
5. Refer to the technical report and training logs to understand the model's training process and optimization methods for better use and improvement.
Featured AI Tools

Gemini
Gemini is the latest generation of AI system developed by Google DeepMind. It excels in multimodal reasoning, enabling seamless interaction between text, images, videos, audio, and code. Gemini surpasses previous models in language understanding, reasoning, mathematics, programming, and other fields, becoming one of the most powerful AI systems to date. It comes in three different scales to meet various needs from edge computing to cloud computing. Gemini can be widely applied in creative design, writing assistance, question answering, code generation, and more.
AI Model
11.4M
Chinese Picks

Liblibai
LiblibAI is a leading Chinese AI creative platform offering powerful AI creative tools to help creators bring their imagination to life. The platform provides a vast library of free AI creative models, allowing users to search and utilize these models for image, text, and audio creations. Users can also train their own AI models on the platform. Focused on the diverse needs of creators, LiblibAI is committed to creating inclusive conditions and serving the creative industry, ensuring that everyone can enjoy the joy of creation.
AI Model
6.9M