Spaces:
Running
Running
File size: 2,199 Bytes
e18a9c5 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 |
---
title: CedroPM Bot
emoji: π€
colorFrom: blue
colorTo: indigo
sdk: docker
pinned: false
app_port: 7860
---
# PM-RAG-ChatBot (CedroPM)
A specialized RAG (Retrieval Augmented Generation) chatbot designed for Project Managers and Software Architects. It processes Software Requirements Documents (SRDs) and provides expert-level analysis by combining project-specific data with a pre-seeded "Gold Standard" knowledge base of industry best practices.
## π Key Features
* **π Intelligent SRD Analysis**: Upload PDF requirements and get answers grounded strictly in your document.
* **π§ Global Expert Knowledge**: Pre-seeded with 30+ industry standards (ISO, OWASP, NIST, Agile) to provide "Senior Architect" advice even when your PDF is silent.
* **π¨ Diagram Vision**: Understands and explains architecture diagrams using Claude Vision or Qwen2-VL.
* **π Hybrid Search**: Uses a dual-scope retrieval engine to search your **Project PDF** and **Global Best Practices** simultaneously.
* **π¬ Interactive Chat**: Built with Chainlit for a clean, chat-like interface with history retention.
* **π Secure & Private**: Multi-user isolation ensures one project's data never leaks to another.
## ποΈ System Architecture
1. **Ingestion Layer**:
* **PDFs**: Processed with `pdfplumber` and `SmartSRDSplitter` (Section-aware chunking).
* **Tables**: Extracted via `camelot` and converted to Markdown.
* **Diagrams**: Processed via OCR (`Tesseract`) or Vision LLM (`Claude 3.5 Sonnet`).
2. **Knowledge Store**:
* **ChromaDB**: vector storage for semantic search.
* **Dual-Scope**: Queries filter by `(User + Project)` OR `(Global_Expert_Knowledge)`.
3. **Reasoning Engine**:
* **Claude 3.5 Sonnet**: Generates detailed, structured responses.
* **Seed Data**: A Python script injects "Golden Rules" for Security, DevOps, and PM methodologies.
## π Prerequisites
* **Python 3.10+**
* **System Tools**:
* `Poppler` (for PDF rendering)
* `Tesseract OCR` (for image text extraction)
* **API Key**: Anthropic API Key (for Claude).
## π§ Installation & Setup
### 1. Clone the Repository
```bash
git clone <your-repo-url>
cd PM-RAG-ChatBot
|