I built DocQuify because I kept running into the same problem: I'd have a 60-page PDF open and need one specific thing from it. Ctrl+F only gets you so far when you don't know the exact wording.
The idea was straightforward: let people just ask their documents questions in normal language and get answers pulled from the actual content. Not summaries, not guesses. Specific answers with the context behind them.
The technical part that took the most thought was the chunking strategy. Splitting a PDF into pieces sounds simple until you realize that meaning often spans across paragraphs. A naive split loses that. I ended up using overlapping chunks so context doesn't get cut off at boundaries, which made a real difference in answer quality.
From there, each chunk gets embedded via OpenAI, stored in a vector DB, and when someone asks a question, their query gets embedded too and matched against the stored chunks semantically. The closest chunks go into the prompt as context, and the model answers from those rather than general knowledge.