Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
evaluation
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Evals Aren’t a One-Time Report: Build a Living Test Suite That Ships With Every Release.
Lamhot Siagian
Lamhot Siagian
Lamhot Siagian
Follow
Feb 22
Evals Aren’t a One-Time Report: Build a Living Test Suite That Ships With Every Release.
#
llm
#
ai
#
evaluation
1
 reaction
Comments
Add Comment
6 min read
If you don't red-team your LLM app, your users will
Lamhot Siagian
Lamhot Siagian
Lamhot Siagian
Follow
Feb 22
If you don't red-team your LLM app, your users will
#
ai
#
llm
#
evaluation
#
security
1
 reaction
Comments
Add Comment
7 min read
Go Ahead and Judge Me- Agent Evaluators in AWS AgentCore
mgbec
mgbec
mgbec
Follow
for
AWS Community Builders
Jan 25
Go Ahead and Judge Me- Agent Evaluators in AWS AgentCore
#
evaluation
#
agents
#
amazonbedrock
Comments
Add Comment
6 min read
Why Image Hallucination Is More Dangerous Than Text Hallucination
Priyam
Priyam
Priyam
Follow
Jan 6
Why Image Hallucination Is More Dangerous Than Text Hallucination
#
evaluation
#
ai
#
machinelearning
#
futureagi
Comments
Add Comment
1 min read
The Self-Evolving Agent (Part 3): The Human in the Loop
Imran Siddique
Imran Siddique
Imran Siddique
Follow
Jan 1
The Self-Evolving Agent (Part 3): The Human in the Loop
#
architecture
#
aigovernance
#
evaluation
#
engineeringleadershi
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account