Gauntlet AI
A 200-hour intensive program focused on building and deploying production-grade AI systems end to end.
Team
3-Person Squad
Capstone
Healthcare AI Agent
Finale
Demo Day Presenter
Program
Applied AI Engineering
Specialty
AI Engineering
Hours Awarded
200
Format
7wk Remote + 2wk On-Site
Completed
February 2026
Issued
March 30, 2026
Certificate ID
CERT-2026-9A2D62F9327DE81A
About the Program
The full AI engineering stack from end to end — RAG pipelines, autonomous agents, evaluation frameworks, and production deployment.
The curriculum covers seven weeks of hands-on modules: building and evaluating RAG pipelines, designing autonomous agents with LangChain and LangGraph, implementing vector databases and Graph RAG, building SQL agents with MCP integrations and memory systems, and working with modern patterns including evals, fine tuning, and spec-driven development. Every module ships working code — not slides.
The program culminates in a two-week on-site intensive where teams build and deploy a functional AI system for a capstone project focused on real business ROI, followed by a Demo Day presentation.
Requirements
My Experience
I served as both product manager and engineer on a three-person team — bridging product thinking with hands-on technical execution.
On the product side, I owned the roadmap, requirements, and cross-functional delivery. On the engineering side, I designed the multi-agent architecture for our capstone — a customer-facing AI system that turns natural-language questions into validated SQL queries against a healthcare SaaS database — and built the end-to-end evaluation harness covering intent classification, SQL correctness, security boundary enforcement, and response quality.
The biggest shift for me was evaluation rigor. Learning to build systematic evals — not just vibes-checking outputs — changed how I think about shipping AI. I'm now applying that same discipline to AI-assisted design workflows and AI product features in my current role.
My Role
Product manager and engineer — owned roadmap, multi-agent architecture, and the full evaluation harness.
Capstone
A production AI agent that answers data questions in real time for healthcare SaaS — natural language in, validated SQL out.
Biggest Takeaway
Systematic evaluation rigor — knowing how to build evals that catch what vibes-checking misses, before it ships.
Skills & Technologies
Ask me anything
Honest answers, not a sales pitch.
Ask about James's experience, skills, approach, or anything on the site.