Lessons Learned from Using LLMs to Evaluate LLMs
February 2024
When and how to use LLMs as a tool to evaluate and monitor the quality of other LLMs
Read more →February 2024
When and how to use LLMs as a tool to evaluate and monitor the quality of other LLMs
Read more →We present GRUEN, a great and helpful metric for evaluating text quality for grammatical correctness, redundancy and focus
Read more →Use OpenTelemetry to get LLM observability without adopting any new tools
Read more →This blog post explores the usage of the BLEU metric for quality assessment of text translation generative AI tasks
Read more →This blog post explores the usage of the rouge metric for quality assessment of text summarization generative AI tasks
Read more →Introducing OpenLLMetry - a set of extensions on top of OpenTelemetry that provide observability for LLM applications
Read more →