James Hendrie

Product + Systems Leadership
Back Certification

Gauntlet AI

Applied AI Engineering

A 200-hour intensive program focused on building and deploying production-grade AI systems end to end.

Team

3-Person Squad

Capstone

Healthcare AI Agent

Finale

Demo Day Presenter

Gauntlet AI

Program

Applied AI Engineering

Specialty

AI Engineering

Hours Awarded

200

Format

7wk Remote + 2wk On-Site

Completed

February 2026

Issued

March 30, 2026

Certificate ID

CERT-2026-9A2D62F9327DE81A

Verify

About the Program

The full AI engineering stack from end to end — RAG pipelines, autonomous agents, evaluation frameworks, and production deployment.

The curriculum covers seven weeks of hands-on modules: building and evaluating RAG pipelines, designing autonomous agents with LangChain and LangGraph, implementing vector databases and Graph RAG, building SQL agents with MCP integrations and memory systems, and working with modern patterns including evals, fine tuning, and spec-driven development. Every module ships working code — not slides.

The program culminates in a two-week on-site intensive where teams build and deploy a functional AI system for a capstone project focused on real business ROI, followed by a Demo Day presentation.

Requirements

  • Complete all module assignments and evaluations across RAG, agents, and evals
  • Build and deploy a functional AI system for the capstone project
  • Demonstrate proficiency with LangChain, LangGraph, LangSmith, Claude Code, Cursor, and vector databases
  • Meet defined performance thresholds on evaluations
  • Participate in spec-driven development and AI-first coding exercises
  • Complete the on-site intensive and present capstone at Demo Day

My Experience

I served as both product manager and engineer on a three-person team — bridging product thinking with hands-on technical execution.

On the product side, I owned the roadmap, requirements, and cross-functional delivery. On the engineering side, I designed the multi-agent architecture for our capstone — a customer-facing AI system that turns natural-language questions into validated SQL queries against a healthcare SaaS database — and built the end-to-end evaluation harness covering intent classification, SQL correctness, security boundary enforcement, and response quality.

The biggest shift for me was evaluation rigor. Learning to build systematic evals — not just vibes-checking outputs — changed how I think about shipping AI. I'm now applying that same discipline to AI-assisted design workflows and AI product features in my current role.

My Role

Product manager and engineer — owned roadmap, multi-agent architecture, and the full evaluation harness.

Capstone

A production AI agent that answers data questions in real time for healthcare SaaS — natural language in, validated SQL out.

Biggest Takeaway

Systematic evaluation rigor — knowing how to build evals that catch what vibes-checking misses, before it ships.

Skills & Technologies

RAG Pipelines Evaluation Frameworks & Metrics Autonomous Agents Multi-Agent Architecture LangChain LangGraph LangSmith Claude Code Cursor Vector Databases Graph RAG SQL Agents MCP Integrations Memory Systems Fine Tuning Spec-Driven Development AI-First Coding Workflows Retrieval Systems & Hybrid Search Prompt Engineering & Decomposition Observability & Monitoring API Design & Backend Deployment Cloud Deployment Frontend Integration for AI Systems

Ask me anything

Honest answers, not a sales pitch.

What would you like to know?

Ask about James's experience, skills, approach, or anything on the site.