AIchitect
StacksGraphBuilderCompareGenome
207 tools · 25 stacks

AI tools are all over the place. This is the full landscape — 207 tools across 17 categories, mapped and connected. Ready to narrow it down? Build your stack →

Team size

Budget

Use case

Stage

Cluster

Stack Layers
What are you building and how is it defined?
How do you write and ship code?
How does your AI think and act?
Which models and infrastructure power it?
How do you build, observe, and extend it?
These tools competes with
LLaVA
vs
InternVL2

Choose LLaVA when…

  • •You want an open-source multimodal model for self-hosted deployment
  • •You're doing research on vision-language instruction following
  • •You need a well-documented baseline for multimodal tasks

Choose InternVL2 when…

  • •You want the highest benchmark scores among open-source vision models
  • •Multi-image and high-resolution document understanding is required
  • •You're comparing models and want the strongest open-weight option
Field
LLaVA⚠
InternVL2
Category
Multimodal
Multimodal
Type
OSS
OSS
Free Tier
✓ Yes
✓ Yes
Plans
—
—
Stars
⭐ 22,000
⭐ 7,800
Health
●40 — Slowing
—
Trajectory
— not enough data
— not enough data
Synced
7 days ago
—

LLaVA

Large Language and Vision Assistant — connects a vision encoder to an LLM for instruction-following with images. OSS research model widely used as a multimodal base. Runs via Ollama.

InternVL2

InternVL2 series from Shanghai AI Lab — consistently top-ranked on open-source multimodal benchmarks. Strong at document understanding, chart analysis, and multi-image reasoning.

LLaVA Website ↗GitHub ↗
InternVL2 Website ↗GitHub ↗

Only LLaVA (3)

MoondreamInternVL2Ollama

Only InternVL2 (3)

LLaVAQwen-VLvLLM
See full comparison in Explore →