Opik by Comet

Opik by Comet

by Comet ML

Open-source LLM evaluation and testing platform by the creators of Comet ML

Open Source Machine learning API Web API Python Self-hosted
Visit Product
291 upvotes 795 views

About

Opik is an open-source LLM evaluation, testing, and monitoring platform developed by Comet ML — a company with a decade of experience in ML experiment tracking. Built specifically for the challenges of generative AI, Opik provides the tooling needed to systematically evaluate LLM application quality, catch regressions before they reach production, and monitor deployed applications over time.

The platform enables developers to log every LLM call with full metadata, create curated datasets for regression testing, and run evaluations using built-in or custom metrics including answer relevance, hallucination detection, context precision, and more. Its CI/CD integration means quality checks can be automated as part of the deployment pipeline.

Opik's connection to the broader Comet ML ecosystem gives it unique strength in experiment tracking — teams can compare not just prompts but entire application configurations, model versions, and retrieval parameters to understand what changes actually improve quality. This scientific approach to LLM development distinguishes teams that ship reliable AI from those that iterate blindly.

Product Features

- LLM tracing and logging for all providers
- Evaluation with 20+ built-in metrics (relevance, faithfulness, etc.)
- Dataset management for regression testing
- CI/CD integration for automated quality gates
- Prompt versioning and comparison
- Online monitoring with production traffic analysis
- Experiment tracking integrated with Comet ML
- Hallucination and toxicity detection
- Self-hosted deployment option
- SDKs for Python and TypeScript

About the Publisher

Comet ML was founded in 2017 by Gideon Mendels and Boris Dayma with the mission of helping ML teams build better models faster through experiment tracking and collaboration. The company has raised over $50 million and serves ML teams at Fortune 500 companies. Opik represents Comet's expansion into the LLM operations space, bringing the same systematic, experiment-driven approach to generative AI that made Comet ML an industry standard for traditional ML development.