AI Product Engineering
Muhammad Zaman is an AI Product Engineer based in Faisalabad, Pakistan who specializes in building production-grade AI-integrated systems end-to-end. He designs and ships streaming LLM backends, real-time chat and voice agents with conversation memory, and multi-session support using FastAPI, Next.js, OpenAI API, and WebSockets.
What Muhammad Zaman builds
- Streaming AI chat and voice agents with sub-300ms first-token latency
- Conversation context memory with sliding-window token management
- Multi-session support with session state in PostgreSQL and Redis
- LLM integration using OpenAI API, LangChain, and Anthropic Claude
- Cost-per-user tracking and token accounting at the API layer
- Production monitoring, rate limiting, and output quality controls
Technologies
FastAPI · Next.js · OpenAI API · LangChain · PostgreSQL · Redis · WebSockets · Python · TypeScript · Vercel · Railway
Example project
Built a real-time AI assistant replacing a static FAQ system. Chose WebSockets over HTTP polling for sub-200ms response latency. Shipped with multi-session support, conversation memory via PostgreSQL, and streaming that begins displaying output in under 300ms.
Available for remote engagements worldwide. Contact Muhammad Zaman or view the full portfolio.