AIchitect
StacksGraphBuilderCompareGenome
207 tools · 25 stacks

AI tools are all over the place. This is the full landscape — 207 tools across 17 categories, mapped and connected. Ready to narrow it down? Build your stack →

Team size

Budget

Use case

Stage

Cluster

Stack Layers
What are you building and how is it defined?
How do you write and ship code?
How does your AI think and act?
Which models and infrastructure power it?
How do you build, observe, and extend it?
These tools competes with
PromptFoo
vs
Vellum

Choose PromptFoo when…

  • •You want CLI-first, config-driven LLM evals
  • •Running eval suites in CI/CD pipelines is a goal
  • •You need red-teaming and safety testing built in

Choose Vellum when…

  • •You want a full LLM product development platform
  • •Prompt management, testing, and deployment in one place
  • •You're iterating on prompts in a team workflow
Field
PromptFoo
Vellum
Category
Prompt & Eval
Prompt & Eval
Type
OSS
SaaS
Free Tier
✓ Yes
✓ Yes
Plans
—
Starter: Paid
Stars
⭐ 5,000
—
Health
●80 — Active
—
Trajectory
— not enough data
— not enough data
Synced
8 days ago
—

PromptFoo

Test and compare prompts across models. Built-in red-teaming, regression testing, and side-by-side model comparison.

Vellum

End-to-end platform for prompt engineering teams. Version prompts, run A/B tests, evaluate quality, and deploy to production with a visual interface.

PromptFoo Website ↗GitHub ↗
Vellum Website ↗

Only PromptFoo (6)

LangfuseDeepEvalOpenAI APIVellumAgentaGalileo

Only Vellum (2)

PromptFooHumanloop
See full comparison in Explore →