Selected Projects

I enjoy making things. Here are a selection of projects that I have worked on over the years.

RAG-TA

RAG-TA

RAG-TA: RAG-based Intelligent Teaching Assistant System The RAG Intelligent Teaching Assistant System is an intelligent teaching assistance platform based on Multimodal Retrieval-Augmented Generation (Multimodal RAG), specifically designed for educational scenarios. The system integrates the following core functions: ๐Ÿค– Intelligent Question Answering System Multimodal Understanding: Supports complex question answering with text and images, capable of understanding charts, formulas, and image content in course materials. Contextual Retrieval: Precise semantic retrieval based on the ChromaDB vector database. Hybrid Retrieval: Combines dense vector retrieval and sparse retrieval (BM25) to improve retrieval accuracy. Source Tracing: Automatically annotates the source file and page number of the answer, ensuring information traceability. ๐Ÿ“š Knowledge Base Management Multi-format Support: Supports various formats including PDF, PPTX, DOCX, TXT, MD, and images. Intelligent Indexing: Automatically extracts text and image content to build a multimodal vector index. Incremental Updates: Intelligently detects file changes and updates only the changed parts, improving efficiency. Folder Management: Supports hierarchical directory structure for easy organization of course materials. ๐Ÿ’ฌ Conversation Management History: Complete conversation history saving and management. Multiple Answer Options: Supports multiple answer versions for the same question, allowing users to switch between them. Folder Classification: Supports conversation folder management for easy course classification. Thinking Mode: Visualizes the AI reasoning process to help understand the answer logic. ๐Ÿ–ผ๏ธ Multimodal Interaction Image Understanding: Automatically extracts and describes image content in PDFs/PPTXs. Real-time Upload: Supports users uploading images and documents for instant question answering. Visual Question Answering: Provides comprehensive answers combining image content and text knowledge.