systems online

Sri

Director, Cloud & Data Engineering with 24 years enterprise experience. Building production AI systems that ship — from multi-agent orchestration to memory infrastructure to LLM observability.

8 Systems Built
10 AI Agents
15+ Services
-- Uptime %

Production Systems

Every project is production-hardened, running on real infrastructure, serving real users. Not demos. Not tutorials. Shipped products.

Lumen live

Family AI operating system with 10 specialized agents, 4-layer memory architecture, and multi-channel communication (WhatsApp, Telegram, web). Local-first with Ollama, cloud fallback chain.

TypeScript Next.js PostgreSQL pgvector Neo4j Redis Ollama Cloudflare Tunnel
15microservices
10AI agents
4-layermemory stack
4LLM providers
Smara live api

Memory-as-a-Service API for AI agents. Temporal Memory Scoring with Ebbinghaus decay curves. 3-call integration. MCP server on npm. Half the price of Mem0 Pro.

Fastify PostgreSQL pgvector Voyage AI Stripe Railway MCP
--status
3API calls
48integration tests
$99pro/mo
ARIES v2.0 shipped open source

AI-powered project estimation platform. Upload a BRD/SOW → 9 specialized AI agents run a visual drag-and-drop pipeline → complete project estimate in under 2 minutes. Built for enterprise innovation competitions.

React Vite React Flow FastAPI SQLite YAML Agents
9AI agents
75source files
37API routes
5workflow templates
Traces open source npm

Local-first LLM observability. PostgreSQL-only alternative to Langfuse (which needs ClickHouse + Redis + MinIO). Single service, ~200MB RAM, deploys in 30 seconds via npx traces-dev.

TypeScript Fastify Next.js Drizzle ORM PostgreSQL
npxdeployable
200MBRAM
2-lineSDK integration
PJ Tales live

AI-powered bedtime story production pipeline for children. Full screenplay format (Pixar/Ghibli quality), 14 TTS voices, 8-dimension quality scoring, visual drag-and-drop pipeline IDE, automated cron generation.

FastAPI React Flow Orpheus TTS Edge TTS Ollama FFmpeg
13pipeline stages
14TTS voices
8quality dimensions
15episodes
MCP Doctor building

Postman meets Datadog for MCP servers. Compliance testing, continuous monitoring, tool-call debugging, latency tracking. 50+ compliance checks. SaaS model targeting indie devs and small teams.

Fastify TimescaleDB Next.js MCP Protocol Stripe
50+compliance checks
95%margins

Architecture Decisions

Lumen's memory stack is the core IP. Four layers working together, each solving a different retrieval problem.

Layer 1 · Speed
Redis Ring Buffer
40-turn sliding window per user, cross-channel unified, compressed at 500-char boundary
Layer 2 · Relevance
Hybrid Search + RRF
BM25 + pgvector cosine → Reciprocal Rank Fusion → LLM re-ranking of top-8
Layer 3 · Relationships
Neo4j Bi-temporal Graph
Expire+create edges, full relationship history, temporal asOf queries
Layer 4 · Salience
Ebbinghaus Decay
Importance feedback loop (+0.05 per retrieval, capped 0.95). Memories that matter persist.
LLM Routing
Local-First Fallback Chain
Ollama → Anthropic → Gemini → OpenAI. Free local inference first, cost tracking per call.
Agent Framework
10 Persona Agents
Specialized agents with soul configs, composable skills, A2A protocol, multi-channel routing

Technologies in Production

Not technologies I've read about. Technologies running in production right now, serving real workloads.

TypeScript / Node.js
Python / FastAPI
Next.js / React
PostgreSQL / pgvector
Redis
Neo4j
Ollama / Gemma / LLaMA
Claude / GPT / Gemini APIs
Voyage AI Embeddings
React Flow
Cloudflare (Pages/Tunnel)
Railway
Stripe Billing
MCP Protocol
Docker / launchd
Drizzle ORM
Orpheus / Edge TTS
Playwright Testing

How I Think

Every system I build starts as a personal pain point, gets hardened to production, then becomes proof of architectural thinking.

Build AI systems that solve real problems in your own life. If it survives your family, it'll survive enterprise.
Principle
Local-First
Run on your hardware. Cloud is fallback, not dependency. Own your data, own your inference.
Principle
Ship, Don't Demo
Every project runs in production with real users. Tutorials prove you can follow instructions. Production proves you can build.
Principle
Recursive Proof of Work
Personal pain → production system → open source → career evidence. The process itself is the product.
smara api
lumen
ollama
checking...