LLaVA-3b
L
Llava 3b
Overview :
LLaVA-3b is a model fine-tuned based on Dolphin 2.6 Phi, using the SigLIP 400M visual tower in an LLaVA manner. The model features multiple image labels and outputs from the latest layer of the visual encoder. This model is based on Phi-2 and is subject to the Microsoft Research license, prohibiting commercial use. Thanks to ML Collective for providing computational resource credits.
Target Users :
LLaVA-3b can be used in applications such as image description generation and visual question answering.
Total Visits: 29.7M
Top Region: US(17.94%)
Website Views : 54.1K
Features
Model Fine-tuning
Model Deployment
Usage in Transformers
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase