Open to Applied AI Engineer roles

Kent · Suhail Shah

I build production
AI systems.

I'm Kent (Suhail Shah), an applied AI engineer specializing in LLM pipelines. I build retrieval, agent orchestration, evals, and the backends that keep them reliable in production.

Get in touchResume ↓
Full-Stack + AI
Live Product
Open Source Work
Featured Project
Live · Streaming CLI

fourpoket

An AI coding agent built on multi-provider orchestration with retries and fallback, AST-aware code operations, and per-call cost tracking.

Built withTypeScript · Node.js · Next.js · Stripe
fourpoket.com ↗npm: fourpoket ↗Full deep dive ↓

Experience

Production systems designed and built inside companies, with a focus on applied AI.

Applied AI / LLM systems
Insurance EnterpriseInsurance Claims OperationsApr 2024 - Nov 2025

Claims Operations AI Assistant

Senior Applied AI Engineer

An internal AI knowledge and drafting platform built for insurance claims operations. The system enabled claims staff to retrieve policy information, understand procedures, locate product details, and draft internal communications through a citation-grounded retrieval architecture combining hybrid search, reranking, structured lookups, and evaluation-driven development.

Enterprise RAGHybrid retrieval + rerankingAI platform engineeringKubernetes operationsEvaluation frameworkCompliance & observability
Core stackNext.js · TypeScript · Python · FastAPI · PostgreSQL · Azure · Docker
Outcomes
Reduced policy and procedure lookup from minutes to a single conversational interaction
Delivered grounded responses with source citations, improving user trust and auditability
Improved retrieval quality through hybrid search, reranking, and structured lookups
Established evaluation suite questions for regression testing and model comparisons
Implemented production-grade observability, tracing, cost monitoring, and compliance controls
Built multi-tenant architecture with database-level isolation using PostgreSQL Row-Level Security
FreelanceEnterprise Knowledge ManagementApr 2023 - Mar 2024

AI Knowledge Platform

Senior Applied AI Engineer

Joined as the first engineer focused on AI products and led the development of a multi-tenant knowledge platform that transformed internal documentation into a citation-grounded conversational search experience. Over twelve months the platform evolved from a simple vector-search prototype into a production knowledge system featuring hybrid retrieval, reranking, integrations, evaluation frameworks, observability, analytics, and knowledge-health tooling.

Enterprise RAGKnowledge managementHybrid retrievalEvaluation frameworksMulti-tenant SaaSObservability
Core stackNext.js · TypeScript · Python · FastAPI · PostgreSQL · pgvector
Outcomes
Built the company's first AI platform from prototype to production over a twelve-month period
Converted early design partners into paying customers through continuous retrieval-quality improvements
Improved retrieval precision from roughly 60% to low/mid 80% range in internal evaluations
Established evaluation-driven development with benchmark suites used for all major retrieval and model changes
Reduced AI operating costs through caching, model routing, and token optimization strategies
Delivered integrations across Notion, Google Drive, Confluence, websites, and uploaded document repositories
Full-stack foundation
AccentureLogisticsJan 2022 - Mar 2023

Document Intelligence Platform

Senior Software Engineer

A document-intelligence product taken from early stage to production, on a two-service architecture: a Node and TypeScript product layer alongside a Python OCR and NLP extraction service, joined by a typed contract.

Multi-tenant SaaSTwo-service architectureKeyboard-first review UXTyped service contracts
Core stackNext.js · Node.js · TypeScript · Python · FastAPI · PostgreSQL
AccentureCorporate servicesNov 2020 - Dec 2021

Corporate Workflow & Approval System

Senior Software Engineer

A travel-and-expense workflow system rebuilt and shipped to production. The core design is a composable architecture where workflow templates and form templates are independent reusable building blocks configured as data, so new request types ship as configuration rather than code.

Team leadComposable workflow engineState machineAudit trail
Core stackReact · TypeScript · Node.js · PostgreSQL · Redis · SAML SSO
SunwayEvents securityNov 2019 - Oct 2020

Event Security Management System

Software Engineer

The backend API and web operations dashboard for an outdoor-events security operation, with API contracts coordinated with a separate mobile team. Replaced spreadsheet scheduling and paper attendance with a constraint-based scheduling engine, QR check-in with GPS verification, and a real-time operations dashboard.

Constraint-based schedulingReal-time dashboardSpatial queries (PostGIS)Cross-team API contracts
Core stackNode.js · TypeScript · PostgreSQL · PostGIS · Socket.IO · Redis

fourpoket

A complete look at the architecture, engineering decisions, and capabilities behind the product.

fourpoket.com npm install

Architecture

Four repositories, three deployment targets, one cohesive product.

four-poket-backend
Express API · AI orchestration · SQLite · multi-tenant
Railway
four-poket-cli
Ink terminal UI · AST parsing · code read/write engine
npm
four-poket-web
Next.js 16 · Auth · Dashboard · Stripe billing
Vercel
four-poket-admin
React + Vite · Session explorer · Revenue overview
Local
Domain via Cloudflare · Monitoring via Sentry · Emails via Resend

Tech Stack

Backend
TypeScriptNode.jsExpressSQLiteLevelDBZod
Frontend
ReactNext.js 16Tailwind CSS
CLI
Commander.jsInk (React)Web WASM
Auth & Payments
JWTGoogle OAuthStripebcrypt
AI Providers
DeepSeekMiniMaxOpenRouter
DevOps & Infra
RailwayVercelCloudflareDockerSentryResend
Admin Dashboard
ReactViteReact QueryRechartsReact Router
Testing
JestUnit TestsIntegration Tests

Engineering Highlights

The key technical achievements. Click any card to expand.

Open Source

Personal AI systems work, built in public with readable source.

Personal projectAI toolingGitHub ↗2026

learning-system

Public source, actively developed

A local learning platform where an LLM acts as the teacher and a typed Python backend owns memory, orchestration, and context engineering. The public source spans the applied AI stack: provider abstraction over two model transports, pgvector retrieval, a versioned eval framework with regression reporting, OpenTelemetry tracing, and an approval-gated agent layer.

Agent orchestrationEval frameworkLLM observabilitypgvector retrievalTool callingHuman-in-the-loop
Core stackPython · FastAPI · Pydantic · PostgreSQL · pgvector · OpenTelemetry · pytest
Outcomes
Public commit history with signed commits and pre-commit gates for lint, strict typing, and commit format
Versioned eval sets run against either transport with deterministic and LLM-as-judge scoring
A regression report diffs each eval run against the last, per set and per item
Every LLM round trip is recorded with latency and cost fields and linked to traces and error logs
Agent mutations apply atomically behind a human approval gate, and failures survive rollback
Smoke scripts verify transport contracts against live providers, separate from the unit suite

About

Hey, I'm Kent. I build the layer between raw model capability and software people can actually trust.

Here's the bet I'm making with my career. Every powerful tool has always charged an entry fee. Blender, Photoshop, a serious spreadsheet, each one takes months before it gives anything back. AI collapses that fee into a conversation. You say what you want and the tool meets you there. Wrapping hard software in plain language is the biggest shift in how people use computers since the GUI, and it gets won or lost at the application layer. That's where I work.

A model on its own is an engine on a stand. Loud, impressive, going nowhere. I build the car around it: retrieval and orchestration as the drivetrain, evals as the brakes, observability as the dashboard. I work the whole machine because the interesting failures hide between the parts. A retrieval bug can look like a prompt problem, and a cost spike can really be a chunking decision. Raw capability is getting cheap. Proof that it works is the hard part, and that's the part I love.

The name “Kent” is a personal nickname derived from the meaning behind my given name, Suhail. In Arabic, Suhail is associated with Canopus, one of the brightest stars in the night sky and a historic navigational star known for guidance and direction. Inspired by that connection, I adopted Kent as the name I use professionally and in everyday life.

Whether you know me as Suhail or Kent, they're both part of the same story—and both refer to the same person.

Applied AI
LLM pipelines, retrieval, agents, and evals.
Full-Stack
Backend, frontend, infra, and everything from embeddings to evals in between.
Live Product
A production AI tool, live and taking real payments.
End-to-End Ownership
From model behavior to billing webhooks, the whole system.
What I Bring
LLM OrchestrationRAG & Hybrid RetrievalAgent SystemsLLM EvaluationStructured OutputsStreamingHuman-in-the-Loop DesignContext EngineeringPrompt EngineeringCost & Latency Optimization

Get in Touch

Interested in working together? Let's connect.

Open to full-time Applied AI Engineer roles · Remote

hello@kentdoki.devGitHubLinkedInfourpoket.comnpm: fourpoket