Skip to content

DEV Community

# evaluation

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Feb 22

Evals Aren’t a One-Time Report: Build a Living Test Suite That Ships With Every Release.

#llm #ai #evaluation

6 min read

Feb 22

If you don't red-team your LLM app, your users will

#ai #llm #evaluation #security

7 min read

mgbec for AWS Community Builders

Jan 25

Go Ahead and Judge Me- Agent Evaluators in AWS AgentCore

#evaluation #agents #amazonbedrock

6 min read

Priyam

Jan 6

Why Image Hallucination Is More Dangerous Than Text Hallucination

#evaluation #ai #machinelearning #futureagi

1 min read

Jan 1

The Self-Evolving Agent (Part 3): The Human in the Loop

#architecture #aigovernance #evaluation #engineeringleadershi

4 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.