AI ENGINEER
2025
← HOME

Working Philosophy

The principles that guide my development process

01

AI-First Thinking

Start with the intelligence layer. Route tasks to the right model based on complexity, latency, and cost. Use local inference for sensitive data, cloud for scale.

02

Iterate in Production

Ship fast, measure everything. Real user feedback beats perfect planning. Deploy daily with automated rollback. Monitor latency, error rates, and user satisfaction in real-time.

03

Build to Scale

Start with serverless, add infrastructure when needed. Design for horizontal scaling from day one. Use managed services until you need control, then self-host strategically.

04

Automate Everything

If I do it twice, I automate it. Workflows for deployment, testing, monitoring, and data processing. Let AI handle the repetitive, keep humans in the loop for judgment.

Reasoning Engines

The LLMs I route to based on task requirements

COMPLEX REASONING
CLOUD

GPT-4o

OpenAI · 128K context · $5/1M tokens

My go-to for multi-step workflows, code generation, and strategic planning. Strong tool-use capabilities and reliable structured output.

When: Complex logic, code generation, multi-step tasks
DOCUMENT ANALYSIS
CLOUD

Claude 3.5 Sonnet

Anthropic · 200K context · $3/1M tokens

Best for long-context analysis, detailed reasoning, and instruction following. Superior safety and nuanced understanding of complex documents.

When: Large documents, careful analysis, compliance requirements
FAST TASKS
CLOUD

GPT-4o-mini

OpenAI · 128K context · $0.15/1M tokens

Cost-optimized for classification, summarization, and routine automation. 15x cheaper than GPT-4o with ~300ms latency.

When: High-volume tasks, simple classification, rapid prototyping
PRIVATE DATA
SELF-HOSTED

Llama 3.1 70B

Meta · Ollama Runtime · On-Premise

Local inference for sensitive data. Zero external API calls, complete control. Competitive quality with privacy guarantees.

When: PII processing, enterprise compliance, offline requirements
VOICE AI
HYBRID

ElevenLabs + Whisper

Conversational AI · <800ms latency

Real-time voice interactions with Whisper STT, LLM reasoning, and ElevenLabs TTS. WebRTC streaming for sub-second responses.

When: Customer service, voice assistants, accessibility features

Development Stack

Daily tools and frameworks I work with

TERMINAL
LOCAL

Warp Terminal

Rust-based · AI-native

GPU-accelerated terminal with AI command generation, workflow blocks, and collaborative sessions. My command center for everything.

When: Every day, all day — command execution, debugging, deployment
FRONTEND
FRAMEWORK

React + Vite

React 18 · Tailwind CSS

Modern React with Vite for instant HMR. Tailwind for rapid UI development. Component-driven architecture with TypeScript for type safety.

When: Dashboards, admin panels, customer-facing interfaces
BACKEND
SERVERLESS

Vercel Edge Functions

Node.js · Global CDN

Serverless functions at the edge. <50ms cold starts, automatic scaling, deployed across 30+ regions. Perfect for API endpoints.

When: API routes, webhooks, serverless backends
DATABASE
CLOUD

Supabase

PostgreSQL 15 · Real-time subscriptions

Managed Postgres with automatic API generation, row-level security, and real-time subscriptions. Edge functions for serverless compute.

When: Transactional data, user management, real-time features
VECTOR SEARCH
SELF-HOSTED

Weaviate

v1.25 · HNSW Index

Vector database for semantic search and RAG. Multi-modal embeddings, hybrid search, <100ms query latency at 2.8M+ vectors.

When: Semantic search, RAG applications, similarity matching
CONTAINERS
ORCHESTRATION

Docker Swarm

v24.x · 30+ services

Container orchestration for microservices. Service discovery, load balancing, rolling updates, and automatic failover across 6 nodes.

When: Microservices architecture, self-hosted infrastructure

Automation & Integration

How I connect systems and automate workflows

WORKFLOW ENGINE
SELF-HOSTED

n8n Enterprise

150+ workflows · 8.5K daily executions

Visual workflow automation with 400+ integrations. Custom nodes, error handling, webhook triggers. My automation command center.

When: Data pipelines, event-driven workflows, system integration
SaaS INTEGRATION
CLOUD

Zapier Premium

80+ Zaps · 50K monthly tasks

Enterprise SaaS integration for CRM, email, calendar, and business tools. Multi-step automation with conditional logic.

When: Connecting SaaS tools, marketing automation, business workflows
PROTOCOL BRIDGE
SELF-HOSTED

MCP Server

Model Context Protocol · 45 tools

Standardized protocol for LLM-to-system communication. JSON-RPC interface for tool calling and context management.

When: LLM tool integration, agent frameworks, system control

Quality Standards

The benchmarks I target for production systems

<200ms
API Latency
99.7%
Uptime Target
Daily
Deployments
100%
Test Coverage
<5min
Rollback Time
Auto
Scaling