AIchitect
StacksGraphBuilderCompareGenome
207 tools · 25 stacks

AI tools are all over the place. This is the full landscape — 207 tools across 17 categories, mapped and connected. Ready to narrow it down? Build your stack →

Team size

Budget

Use case

Stage

Cluster

Stack Layers
What are you building and how is it defined?
How do you write and ship code?
How does your AI think and act?
Which models and infrastructure power it?
How do you build, observe, and extend it?
These tools competes with
TruLens
vs
DeepEval

Choose TruLens when…

  • •evaluating RAG pipeline quality — groundedness and relevance
  • •want open-source evals with a visual results dashboard
  • •building with LangChain or LlamaIndex and need eval integration

Choose DeepEval when…

  • •You want a pytest-style framework for LLM testing
  • •Unit-test-like evals for LLM outputs fit your workflow
  • •You need RAG-specific metrics like faithfulness and relevancy
Field
TruLens
DeepEval
Category
Prompt & Eval
Prompt & Eval
Type
OSS
OSS
Free Tier
✓ Yes
✓ Yes
Plans
Open Source: Free
—
Stars
⭐ 2,100
⭐ 5,500
Health
—
●80 — Active
Trajectory
— not enough data
— not enough data
Synced
—
today

TruLens

TruLens is an open-source library for evaluating and tracking LLM-based applications, with a focus on RAG pipelines. It provides feedback functions for groundedness, answer relevance, and context relevance, plus a dashboard for visualizing eval results across experiments.

DeepEval

Open-source evaluation framework with 14+ metrics including faithfulness, relevancy, and hallucination detection. Integrates with CI/CD.

TruLens Website ↗GitHub ↗
DeepEval Website ↗GitHub ↗

Shared Connections (1)

RAGAS

Only TruLens (1)

DeepEval

Only DeepEval (6)

LangfusePromptFooOpenAI APITruLensInspectGalileo
See full comparison in Explore →