InternVL
I
Internvl
Overview :
InternVL, by extending the ViT model to 6 billion parameters and aligning with the language model, has constructed the largest open-source visual basic model currently available, a 14B model, which has achieved state-of-the-art performance in a wide range of tasks including visual perception, cross-modal retrieval, and multimodal dialogue, with 32 published papers demonstrating its excellence.
Target Users :
["Computer Vision Research","Multimodal Application Development"]
Total Visits: 474.6M
Top Region: US(19.34%)
Website Views : 148.8K
Use Cases
Using InternViT-6B for image classification
Using InternVL-C for image text retrieval
Using InternVL-Chat for visual question answering
Features
Image Classification
Semantic Segmentation
Video Classification
Image Text retrieval
Vision-Language Modeling
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase