3FS
3
3FS
Overview :
3FS is a high-performance distributed file system designed for AI training and inference workloads. Leveraging modern SSDs and RDMA networks, it provides a shared storage layer, simplifying distributed application development. Its core advantages include high performance, strong consistency, and support for various workloads, significantly improving AI development and deployment efficiency. This system is suitable for large-scale AI projects, particularly excelling in data preparation, training, and inference stages.
Target Users :
3FS is ideal for AI developers and research teams requiring high-performance storage solutions, especially those handling massive datasets and complex model training. It significantly improves data processing efficiency and reduces development and deployment costs.
Total Visits: 492.1M
Top Region: US(19.34%)
Website Views : 54.9K
Use Cases
In large-scale AI training, 3FS is used to store and quickly access training data, significantly improving training speed.
During the inference stage, 3FS's KVCache function provides efficient caching support for LLM inference, reducing computational overhead.
3FS is used in the data preparation stage to efficiently manage intermediate outputs of the data pipeline, optimizing the data processing workflow.
Features
High Performance: Combines the throughput of thousands of SSDs and the network bandwidth of hundreds of storage nodes to support large-scale parallel access.
Strong Consistency: Employs the CRAQ protocol to ensure strong data consistency, simplifying application development.
Support for Multiple Workloads: Suitable for scenarios such as data preparation, training sample loading, checkpoint saving, and inference caching.
Ease of Use: Provides standard file interfaces, eliminating the need to learn new storage APIs.
High Scalability: Supports large-scale cluster deployments to meet the needs of AI projects of various sizes.
How to Use
1. Clone the 3FS repository from GitHub: `git clone https://github.com/deepseek-ai/3fs`.
2. Initialize submodules: `cd 3fs && git submodule update --init --recursive`.
3. Install dependencies such as CMake, libuv, liblz4, etc. (see documentation for specific dependencies).
4. Build 3FS: `cmake -S . -B build -DCMAKE_BUILD_TYPE=RelWithDebInfo`, then run `cmake --build build`.
5. Deploy a test cluster, configuring storage nodes and clients according to the deployment guide.
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase