DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Using a Single GPT Client as a Language Runtime (No API, No Agents)

Using a Single GPT Client as a Language Runtime (No API, No Agents)

Comments
2 min read
RAG Evaluation Metrics: Measuring What Actually Matters

RAG Evaluation Metrics: Measuring What Actually Matters

Comments
10 min read
Beyond input(): Building Production-Ready Human-in-the-Loop AI Agents with LangGraph

Beyond input(): Building Production-Ready Human-in-the-Loop AI Agents with LangGraph

1
Comments
6 min read
SpecMD — What if Your Documentation Was Your Code?

SpecMD — What if Your Documentation Was Your Code?

Comments
2 min read
Key Skills You Must Master to Succeed in LLM Interviews

Key Skills You Must Master to Succeed in LLM Interviews

Comments
3 min read
🧰 I Built LLMKit: A Complete Toolkit for Testing LLM APIs

🧰 I Built LLMKit: A Complete Toolkit for Testing LLM APIs

Comments
2 min read
AI without the hype: using LLMs to reduce noise, not replace thinking

AI without the hype: using LLMs to reduce noise, not replace thinking

Comments
4 min read
Cutting LLM Expenses and Response Times by 70% Through Bifrost's Semantic Caching

Cutting LLM Expenses and Response Times by 70% Through Bifrost's Semantic Caching

5
Comments
6 min read
LLMs Unleashed: Powering the Future 🚀

LLMs Unleashed: Powering the Future 🚀

Comments
2 min read
Building Bulletproof AI Apps: Multi-Provider Failover with Bifrost

Building Bulletproof AI Apps: Multi-Provider Failover with Bifrost

5
Comments
7 min read
🛠 Local LLM Ops 2025: A Developer's Guide to Running Pocket-Sized Neural Networks

🛠 Local LLM Ops 2025: A Developer's Guide to Running Pocket-Sized Neural Networks

Comments
2 min read
RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

Comments
7 min read
Let’s talk about: Goose!

Let’s talk about: Goose!

Comments
15 min read
The Orphan Axiom Problem in Ontology-Based RAG

The Orphan Axiom Problem in Ontology-Based RAG

Comments
6 min read
⚙️ One Tool, Many Brains: Building a Multi-Model DevOps Architect

⚙️ One Tool, Many Brains: Building a Multi-Model DevOps Architect

Comments
7 min read
How to Implement Observability for AI Agents with LangGraph, OpenAI Agents, and Crew AI

How to Implement Observability for AI Agents with LangGraph, OpenAI Agents, and Crew AI

Comments
6 min read
Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Comments
12 min read
How to Build Multi-Provider Failover Strategies with Bifrost for Ultra‑Reliable AI Applications

How to Build Multi-Provider Failover Strategies with Bifrost for Ultra‑Reliable AI Applications

5
Comments
8 min read
Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Comments
7 min read
Dec 19, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Dec 19, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Comments
5 min read
📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

1
Comments
1 min read
How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

Comments
18 min read
Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Comments
5 min read
How to Use Synthetic Data to Evaluate LLM Prompts: A Step-by-Step Guide

How to Use Synthetic Data to Evaluate LLM Prompts: A Step-by-Step Guide

Comments
8 min read
I Didn’t Build a Chatbot — I Built an AI That Runs the System

I Didn’t Build a Chatbot — I Built an AI That Runs the System

Comments
2 min read
loading...