# Visual
Fresh Picks

Minicpm O
MiniCPM-o 2.6 is the latest multimodal large language model (MLLM) developed by the OpenBMB team, featuring 8 billion parameters and capable of high-quality visual, voice, and multimodal interactions on edge devices like smartphones. This model is built on SigLip-400M, Whisper-medium-300M, ChatTTS-200M, and Qwen2.5-7B, trained in an end-to-end manner, and performs comparably to GPT-4o-202405. Its main advantages include leading visual capabilities, advanced voice functionality, powerful multimodal streaming abilities, impressive OCR performance, and superior efficiency. The model is open-source and free to use for academic research and commercial purposes.
AI Model
63.8K

Qwen VL
Qwen-VL is a general-purpose visual language model launched by Alibaba Cloud. It has powerful visual understanding and multimodal reasoning capabilities. The model supports zero-shot image description, visual question answering, text understanding, image landmark localization, and other tasks, achieving or exceeding the current state-of-the-art performance in multiple visual benchmark tests. Qwen-VL employs a Transformer architecture, pre-trained with a scale of 7B parameters, and supports 448x448 resolution for end-to-end processing of multimodal input and output between images and text. The model's advantages include its strong generality, multilingual support, and fine-grained understanding. It can be widely applied in tasks such as image understanding, visual question answering, image annotation, and text-to-image generation.
AI image detection and recognition
157.0K

Moondream
moondream is a 1.6 billion parameter model built using the SigLIP, Phi-1.5, and LLaVA training datasets. Due to the use of the LLaVA dataset, the weights are protected by the CC-BY-SA license. You can try it out on Huggingface Spaces. The model's performance on the VQAv2, GQA, VizWiz, and TextVQA benchmark tests is as follows:
LLaVA-1.5 (13.3B parameters): 80.0, 63.3, 53.6, 61.3
LLaVA-1.5 (7.3B parameters): 78.5, 62.0, 50.0, 58.2
MC-LLaVA-3B (3B parameters): 64.2, 49.6, 24.9, 38.6
LLaVA-Phi (3B parameters): 71.4, -, 35.9, 48.6
moondream1 (1.6B parameters): 74.3, 56.3, 30.3, 39.8.
AI Model
62.4K

Appweaver
AppWeaver is a no-code development platform that enables non-technical users to quickly construct web applications through a drag-and-drop component interface. It offers a wide range of visual components, allowing developers to create various applications such as web apps, mobile apps, and administration backends without coding. Key advantages include: 1) No coding required, reducing the entry barriers for development; 2) Powerful functionalities and a rich variety of components; 3) Multi-platform development support, with a single codebase capable of generating web, iOS, and Android apps. Priced at a free and premium version, the free version provides basic functionalities suitable for learning and small-scale application development. The premium version offers more advanced components, features, and technical support. AppWeaver is positioned as an easy-to-use visual development platform for non-technical users.
Development & Tools
63.5K

Internvl
InternVL, by extending the ViT model to 6 billion parameters and aligning with the language model, has constructed the largest open-source visual basic model currently available, a 14B model, which has achieved state-of-the-art performance in a wide range of tasks including visual perception, cross-modal retrieval, and multimodal dialogue, with 32 published papers demonstrating its excellence.
AI Model
147.7K

Alpha Sender
Alpha Sender is an all-in-one email marketing tool that combines intelligence, a drag-and-drop email editor, and campaign engagement analysis to help businesses achieve higher conversion rates through email marketing. It offers personalized emails, a drag-and-drop email editor, engagement analytics, and advanced marketing features like pop-ups and embedded forms, allowing businesses to capitalize on growth opportunities and boost sales performance.
Sales
48.3K

NEX
NEX is a media technology company developing controllable AI models for visual expression. We believe technology will help us pursue great stories. Our mission is to enhance humanity's storytelling capabilities.
AI design tools
47.2K

Blenny AI
Blenny AI is an AI-powered visual assistance tool that helps users capture screenshots and perform intelligent analysis of web pages. Users can leverage screenshots to quickly access AI summaries, translations, and webpage navigation. Additionally, Blenny AI supports custom AI agents, providing personalized services tailored to user needs. Blenny AI is powered by GPT-4V.
AI design tools
51.1K

Moji AI
Moji AI - Chat & Content AI is your ultimate all-in-one app to master the art of content creation, powered by advanced artificial intelligence. Moji AI simplifies and elevates your writing and content management experience, becoming an indispensable tool for professionals and creative individuals alike. Key features include an AI writing assistant, email writing templates, text-to-image generation, and an Instagram engagement calculator. Moji AI - Chat & Content AI Pro plan: Monthly: $9.99, Annual: $89.99. Compatible with iPhone, iPad, and Mac, with English language support.
AI content generation
61.0K

Windframe
Windframe is an AI-enhanced visual Tailwind builder and editor that enables you to rapidly prototype and build stunning web pages. Accelerate your web development workflow with minute-level delivery.
AI design tools
84.2K

Telesite
Telesite is a free website creation tool that utilizes artificial intelligence to generate a complete website in just a few seconds based on text and images. It is ideal for users who need to build a website quickly, as it doesn't require programming or design skills. With Telesite, you can create sleek and visually appealing websites that are mobile-optimized, all without any coding or design experience. The platform offers a variety of templates and a drag-and-drop visual builder, making it extremely user-friendly. Moreover, Telesite integrates seamlessly with Telegram bots, providing them with web-based presence. Overall, it is a powerful and easy-to-use free website builder for quick and efficient site development.
Website Generation
104.9K

Quill News Digest
Quill News Digest is an unconventional daily news digest application. It offers the most important stories through visual summaries and easy-to-read collections. Featuring images, maps, and quotes. Spend less time catching up on the latest happenings and more time enjoying what you do! We curate the most important stories from a variety of sources on the internet, providing you with concise, unbiased, and swift reading. Each collection includes a "quick" read, an expanded summary, images, quotes, relevant locations, and the option to read the full article from your chosen source. Quill publishes a digest every morning at 8:00 AM, allowing you to scan the most important stories on the internet. With Quill, you can ensure you stay up-to-date without being overwhelmed by excessive biased information.
AI News
51.3K

Buildship
BuildShip is a low-code, visual tool that uses AI technology to quickly build application backends, workflows, APIs, scheduled tasks, and serverless functions. It supports connecting pre-built nodes or generating custom nodes. Its AI models and tools enable the easy construction of multimodal workflows. You can integrate any AI model and tool into your workflows. BuildShip also offers templates and customization features, supporting the generation of common functions like HTML to PDF conversion and Stripe scheduled reports. It supports serverless API deployment and scheduled task execution, along with advanced development tools like version control, debugging, and iteration.
Development & Tools
71.5K

Autoember
AutoEmber is a visual design tool for building websites. It enables design, customization, and deployment (without coding) all in one place. AutoEmber empowers millions to create software through an intuitive visual approach, ushering in a new era of programming. Founded by Ben Shumaker and Izak Fritz, AutoEmber is part of the Y Combinator Summer 23 Batch.
AI design tools
42.5K

Usethisprompt
Softr is a visual website builder that helps users quickly build their own websites. It features a simple and user-friendly interface and a rich template library. Users can choose suitable templates based on their needs and customize them through a drag-and-drop editor. Softr also provides a variety of functions, including forms, database integration, user authentication, and responsive design to adapt to different screen sizes. Users can create various types of websites, such as personal blogs, e-commerce websites, and corporate websites, using Softr. The pricing is flexible and diverse, suitable for users of different scales.
Development & Tools
43.3K

AI Bot Builder
AI Bot is a visual, low-code platform that helps you quickly build and customize powerful AI robots. You can use it to build robots based on images, voice, and text, integrate various services, and easily deploy them to Google Cloud. AI Bot supports multiple channels like WhatsApp, Twitter, SMS, Telegram, with flexible expansion and reliable security.
Development and Tools
237.4K
Featured AI Tools

Flow AI
Flow is an AI-driven movie-making tool designed for creators, utilizing Google DeepMind's advanced models to allow users to easily create excellent movie clips, scenes, and stories. The tool provides a seamless creative experience, supporting user-defined assets or generating content within Flow. In terms of pricing, the Google AI Pro and Google AI Ultra plans offer different functionalities suitable for various user needs.
Video Production
43.1K

Nocode
NoCode is a platform that requires no programming experience, allowing users to quickly generate applications by describing their ideas in natural language, aiming to lower development barriers so more people can realize their ideas. The platform provides real-time previews and one-click deployment features, making it very suitable for non-technical users to turn their ideas into reality.
Development Platform
46.1K

Listenhub
ListenHub is a lightweight AI podcast generation tool that supports both Chinese and English. Based on cutting-edge AI technology, it can quickly generate podcast content of interest to users. Its main advantages include natural dialogue and ultra-realistic voice effects, allowing users to enjoy high-quality auditory experiences anytime and anywhere. ListenHub not only improves the speed of content generation but also offers compatibility with mobile devices, making it convenient for users to use in different settings. The product is positioned as an efficient information acquisition tool, suitable for the needs of a wide range of listeners.
AI
43.6K

Minimax Agent
MiniMax Agent is an intelligent AI companion that adopts the latest multimodal technology. The MCP multi-agent collaboration enables AI teams to efficiently solve complex problems. It provides features such as instant answers, visual analysis, and voice interaction, which can increase productivity by 10 times.
Multimodal technology
45.3K
Chinese Picks

Tencent Hunyuan Image 2.0
Tencent Hunyuan Image 2.0 is Tencent's latest released AI image generation model, significantly improving generation speed and image quality. With a super-high compression ratio codec and new diffusion architecture, image generation speed can reach milliseconds, avoiding the waiting time of traditional generation. At the same time, the model improves the realism and detail representation of images through the combination of reinforcement learning algorithms and human aesthetic knowledge, suitable for professional users such as designers and creators.
Image Generation
44.2K

Openmemory MCP
OpenMemory is an open-source personal memory layer that provides private, portable memory management for large language models (LLMs). It ensures users have full control over their data, maintaining its security when building AI applications. This project supports Docker, Python, and Node.js, making it suitable for developers seeking personalized AI experiences. OpenMemory is particularly suited for users who wish to use AI without revealing personal information.
open source
43.9K

Fastvlm
FastVLM is an efficient visual encoding model designed specifically for visual language models. It uses the innovative FastViTHD hybrid visual encoder to reduce the time required for encoding high-resolution images and the number of output tokens, resulting in excellent performance in both speed and accuracy. FastVLM is primarily positioned to provide developers with powerful visual language processing capabilities, applicable to various scenarios, particularly performing excellently on mobile devices that require rapid response.
Image Processing
42.0K
Chinese Picks

Liblibai
LiblibAI is a leading Chinese AI creative platform offering powerful AI creative tools to help creators bring their imagination to life. The platform provides a vast library of free AI creative models, allowing users to search and utilize these models for image, text, and audio creations. Users can also train their own AI models on the platform. Focused on the diverse needs of creators, LiblibAI is committed to creating inclusive conditions and serving the creative industry, ensuring that everyone can enjoy the joy of creation.
AI Model
6.9M