MNBVC
M
MNBVC
Overview :
MNBVC (Massive Never-ending BT Vast Chinese corpus) is a project aimed at providing rich Chinese data for AI. It includes not only mainstream cultural content but also niche cultures and internet slang. The dataset encompasses various forms of pure text Chinese data, such as news, essays, novels, books, magazines, papers, dialogues, posts, wikis, ancient poems, lyrics, product descriptions, jokes, anecdotes, and chat logs.
Target Users :
Suitable for researchers in natural language processing, Chinese machine learning developers, and AI projects requiring large amounts of Chinese data.
Total Visits: 474.6M
Top Region: US(19.34%)
Website Views : 118.7K
Use Cases
Used for training Chinese chatbots
Supports Chinese text mining and sentiment analysis
Serves as a foundation for training Chinese natural language understanding models
Features
Provides a large-scale Chinese corpus
Supports natural language processing and machine learning research
Promotes the development of Chinese AI technology
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase