Logo

dev-resources.site

for different kinds of informations.

СontextCheck: LLM & RAG Evaluation Framework

Published at
11/27/2024
Categories
aiops
Author
edwin_lisowski
Categories
1 categories in total
aiops
open
Author
14 person written this
edwin_lisowski
open
СontextCheck: LLM & RAG Evaluation Framework

Hi all! We open-sourced a framework for testing LLMs, RAGs, and chatbots. The tool automates query generation, completion requests, regression detection, penetration testing, and hallucination assessment. Designed for developers, researchers, and businesses. And we are looking for contributors! Feel free to try it out for yourself and share your feedback!

Repo on Github

aiops Article's
30 articles in total
Favicon
The Future is Now: How AI Consulting Services are Revolutionizing Industries
Favicon
Role of Artificial Intelligence in DevOps
Favicon
The Rise of AIOps: How AI is Transforming IT Operations
Favicon
Debugging and Troubleshooting Generative AI Applications
Favicon
MiniProject — Detect Faces by Using AWS Rekognition!
Favicon
AIOps Powered by AWS: Developing Intelligent Alerting with CloudWatch & Built-In Capabilities
Favicon
Why Rust is the Future of AI and ML Ops
Favicon
How-to Use AI to See Your Data in 3D
Favicon
The Future of DevOps: How AI is Shaping Infrastructure Management
Favicon
AI Ethics | Navigating the Future with Responsibility
Favicon
A Beginner’s Guide To Artificial Intelligence & Its Key Concepts
Favicon
Maximizing AI Agents for Seamless DevOps and Cloud Success
Favicon
Running Phi 3 with vLLM and Ray Serve
Favicon
Primer on Distributed Parallel Processing with Ray using KubeRay
Favicon
Monitoring and Improving AI Model Performance with Handit.AI
Favicon
AI Model Monitoring and Continuous Improvement: A Comprehensive Guide
Favicon
Amazon DevOps Guru for the Serverless applications - Part 14 my wish and improvement list
Favicon
Talk to Your Cloud: Effortless AI-Driven Deployments
Favicon
Amazon DevOps Guru for the Serverless applications - Part 13 Anomaly detection on Aurora Serverless v2 with Data API (kind of)
Favicon
СontextCheck: LLM & RAG Evaluation Framework
Favicon
How to Develop an AI Application: Step-by-Step using Orkes Conductor
Favicon
5 Key takeaways from Gartner AIOps Report
Favicon
Design and Implementation of LLM-based Intelligent O&M Agent System
Favicon
Specialized Domain Models: Unlocking the Power of Tailored AI Solutions
Favicon
The Future of Agentic Systems Podcast
Favicon
Top AI Solutions for Financial Services in 2025
Favicon
Supercharging GitHub Project Management: Building an Intelligent Issue Bot with Cross-Namespace Configuration Support
Favicon
BigPanda
Favicon
What does LLM Temperature Actually Mean?
Favicon
Building Resilient GenAI pipeline with Open-source AI Gateway

Featured ones: