Multi-Agent Research Assistant
Architected a 4-node LangGraph agent graph (router → retriever → analyzer → synthesizer) that classifies query intent, retrieves relevant document chunks, filters noise, and generates citation-backed answers. Built a RAG pipeline using ChromaDB for persistent vector storage and Ollama's nomic-embed-text for local embedding generation. Implemented a relevance scoring layer that ranks chunks by cosine similarity and filters below-threshold results before passing context to the LLM. Served through a FastAPI endpoint with streaming support, returning the answer, source metadata, and a step-by-step agent execution trace.