The Traceloop blog
is under construction 🏗️

Coming soon...

The Traceloop Blog

GRUEN's Outstanding Performance in LLM Quality Evaluation

We present GRUEN, a great and helpful metric for evaluating text quality for grammatical correctness, redundancy and focus

Read more →

DIY observability for LLMs with OpenTelemetry

Use OpenTelemetry to get LLM observability without adopting any new tools

Read more →

Demystifying the BLEU Metric: A Comprehensive Guide to Machine Translation Evaluation

This blog post explores the usage of the BLEU metric for quality assessment of text translation generative AI tasks

Read more →

Evaluating Model Performance with the ROUGE Metric: A Comprehensive Guide

This blog post explores the usage of the rouge metric for quality assessment of text summarization generative AI tasks

Read more →

Introducing OpenLLMetry — Extending OpenTelemetry to LLMs

Introducing OpenLLMetry - a set of extensions on top of OpenTelemetry that provide observability for LLM applications

Read more →