Working demos of AI/GenAI systems built with Python. Each project is a full stack — FastAPI, LLMs, vector search, agents — deployed on real infrastructure.
RAG chat interface — query a knowledge base of documents with a multi-model LLM agent, semantic reranking, and per-user cost tracking.
Another AI/GenAI demo in the pipeline. Check back or follow on LinkedIn.