OpenGVLab

vision-language-models | multimodal-evaluation

Overview

OpenGVLab is the organization behind InternVL, InternLM-V, and M3I-Eval benchmarks. It provides a suite of open-source vision-language models and evaluation tools.

Key Projects

InternVL: Large-scale vision-language foundation model
InternLM-V: Vision-language variant of InternLM
M3I-Eval: Multi-modal multi-choice evaluation benchmark

Relationship to Other Projects

Related to OpenGVLab (self-reference via the organization)
Competes with OpenAI CLIP and LLaVA in the VLM space

References

GitHub: https://github.com/opengvlab