SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file and perform a semantic search on contents.
- 
            Updated
            Apr 4, 2024 
- TypeScript
SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file and perform a semantic search on contents.
Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval
Snappy: A vision-first document retrieval using ColPali embeddings - Search PDFs with FastAPI, Next.js 16, Qdrant, and React 19.2
Python program for searching pdf text, ranking the results and exporting highlighted search results in pdf. Uses trie structure, stack, heap, page graph. Converts queries to postfix notation. Allows for logical expressions and phrases. Offers did you mean functionality.
DocuVisQA(Document Visual Question Answering) is a Python project that leverages Google's Generative AI and Langchain for document processing, text splitting, and question answering. It also supports image processing with Streamlit for interactive UI.
vue功能最全的pdf组件,支持渲染、页码提取与跳转、文件加载完成监听、页面变化监听、文本搜索、关键词高亮、目录提取
Use semantic search on PDFs locally
CLI for merging PDF contexts.
In Development
PDF Parser built in Rust
A web interface that allows searching for PDFs by their content
Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF
A document indexing daemon that can populate Elasticsearch indexes with the contents and metadata of a number of document types including PDF, image scans, etc. Used to power Facile Search, however can be re-used for anything that requires search indexing for scanned documents.
A tool to search for text in PDF files using multiple methods, including OCR (Optical Character Recognition).
Programa que busca uma lista de nomes das Partes Processuais nos PDFs do Diário Oficial.
Cognivia AI is a powerful AI-powered PDF search and question-answering system built with LangChain, Pinecone Vector Store, OpenAI, and Supabase. Upload PDFs, ask questions, and get intelligent answers with persistent conversation memory.
Resume search application using openai RAG and file search . A demo application which shows power of RAG from openai to simplify resume screening . Open source VLM model example to follow
Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...
🚀 Revolutionize your data interaction with a cutting-edge chatbot built on Retrieval-Augmented Generation (RAG) and OpenAI’s GPT-4. Upload documents, create custom knowledge bases, and get precise, contextual answers. Ideal for research, business operations, customer support, and more!
A high-performance RAG system for PDFs using multi-vector embeddings (ColPali / ColQwen / ColSmol) with vector search in Qdrant, prefetch optimization, and reranking for improved relevance. Designed for speed, accuracy, and scalability, this system is ideal for building intelligent search, document understanding, and QA applications.
Add a description, image, and links to the pdf-search topic page so that developers can more easily learn about it.
To associate your repository with the pdf-search topic, visit your repo's landing page and select "manage topics."