Explore projects
-
Updated
-
Updated
-
SciBot is an intelligent, RAG-based research assistant designed to retrieve and answer questions from PDF documents with high accuracy. This system combines the power of LLMs (Large Language Models) with a vector database to perform context-aware question answering directly from PDF data, making it ideal for students, researchers, and professionals.
๐ง Key Features:
๐ PDF Ingestion & Parsing: Automatically reads and extracts structured text content from uploaded PDF files using tools like PyMuPDF or pdfplumber.๐งน Text Preprocessing: Cleans the extracted text, chunks it into semantically meaningful passages, and removes irrelevant formatting or metadata.
๐ Embedding Generation: Converts text chunks into numerical vector embeddings using OpenAI Embeddings API, HuggingFace models, or Instructor embeddings.๐ ๏ธ Vector Database Storage: Stores these embeddings in a fast vector database like ChromaDB, FAISS, or Pinecone, enabling efficient semantic retrieval.๐ค RAG-based Question Answering:When a user asks a question, relevant chunks are retrieved from the vector DB.
These are passed along with the query to an LLM (like GPT-4, OpenAI GPT-3.5, or Gemini).
The model then generates accurate, grounded responses using the retrieved context.
๐งช Research-Oriented UI:
Built with Streamlit for a fast, minimal interface.
Users can upload PDFs, type questions, and get natural language answers.
Highlighted references or citations from the PDF
Updated -
Takkelapati Venika Chowdary / ReadCycle
MIT LicenseUpdated -
Updated
-
Devarashetty Saiteja / FAKE-NEWS-DETECTOR
MIT LicenseUpdated -
Updated
-
Updated
-
Takkelapati Abhinaya Chowdary / ReadCycle
MIT LicenseUpdated -
Updated
-