Project Overview
Objective
Built PDF upload and natural language QA system with retrieval-augmented generation.
Stack
FastAPIReactPyPDFLoaderRecursiveCharacterTextSplitterHuggingFaceEmbeddingsFAISSGPT-4o-miniGPT-4.1GPT-5
Delivery highlights
- Developed a document QA workflow where users upload PDF files and ask questions in natural language, Processed documents with PyPDFLoader and RecursiveCharacterTextSplitter before embedding, Generated semantic embeddings with HuggingFaceEmbeddings (BAAI/bge-m3) and stored them in FAISS for similarity retrieval, and Provided retrieved chunks as context to selectable LLMs (GPT-4o-mini, GPT-4.1, GPT-5) through FastAPI + React UI.