TruLens
TruLens is an open-source library for evaluating and tracking LLM-based applications, with a focus on RAG pipelines. It provides feedback functions for groundedness, answer relevance, and context relevance, plus a dashboard for visualizing eval results across experiments.
DeepEval
Open-source evaluation framework with 14+ metrics including faithfulness, relevancy, and hallucination detection. Integrates with CI/CD.