LLM

4 articles in this category

All3D printingAIAutoGenDesignDevelopmentEngineering & TechnologyFrentiGenerative AIGenerative UIHomesLifeLLMMachine LearningNext.jsOffersPhotographySailingSecurity & InfrastructureTechnology
How We Know an AI Agent Is Actually Good: Eval Harnesses and LLM-as-Judge
Jun 3, 2026

How We Know an AI Agent Is Actually Good: Eval Harnesses and LLM-as-Judge

The difference between an agent that demos well and one you can put in front of customers is measurement. Here's how we score AI agent quality — eval harnesses, LLM-as-judge, and regression tests.

6 min read
Experimenting with the New & Powerful Anthropic Claude API
Mar 3, 2024

Experimenting with the New & Powerful Anthropic Claude API

3 min read
Using CrewAI agents to plan my next trip.
Feb 8, 2024

Using CrewAI agents to plan my next trip.

4 min read
Building an AI Document Chatbot Using Flowise, ChatGPT OpenAI & Pinecone
Jul 4, 2023

Building an AI Document Chatbot Using Flowise, ChatGPT OpenAI & Pinecone

2 min read

Let's Build Something Remarkable

Interested in how AI can transform your business? We help companies move from idea to production, fast.