Bookmark: MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications

lqdev☔12/11/2023

https://www.databricks.com/blog/announcing-mlflow-28-llm-judge-metrics-and-best-practices-llm-evaluation-rag-applications-part?utm_source=twitter&utm_medium=organic-social

LLM-as-a-judge is one promising tool in the suite of evaluation techniques necessary to measure the efficacy of LLM-based applications.

Permalink: https://www.luisquintanilla.me/feed/mlflow-2-8-llm-as-judge-rag-evaluation/

Tags: #mlflow #ai #llm #evaluation

Back to feed