Examplee 2

The document outlines a chatbot pipeline that processes user queries and optional PDF attachments, utilizing a Graph RAG & Structuring Agent to gather relevant financial data. It emphasizes a single flow design for simplicity, modular extensibility for future enhancements, and optimization strategies for model selection and resource efficiency. Hardware requirements include a powerful CPU, ample memory, high-performance GPU, and fast storage for optimal operation.

Uploaded by

Skander Dinari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

Examplee 2

Uploaded by

Skander Dinari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Figure 1: Chatbot Pipeline

1. User Input
The user can supply only a query or a query plus a PDF.

2. PDF Attached? (Decision Node)

Yes: • The document is classified to see if it’s scanned or digital.
• The appropriate text extraction (OCR or digital) is performed.
• The text is then normalized and passed to the Graph RAG & Structuring Agent.
No (Query Only) → Use Graph RAG: • If no PDF is attached, the query goes directly to
Graph RAG & Structuring Agent.
• Relevant financial data is pulled from its knowledge base.

3. Graph RAG & Structuring Agent

Gathers and structures relevant data (tables, market info, regulatory filings, etc.) based on:
• The user’s query alone, or
• The combination of the query + extracted document text.

1
4. Merge Processed Doc & Query
Combines all text and context into a single prompt for the next step.
5. Finance LLM Agent
Generates the response using the merged prompt, leveraging domain-specific financial training.
6. Feedback Loop
Collects user input on the quality or accuracy of the response, used for iterative improvements.
7. Enhanced Output
Returns the final answer to the user, incorporating relevant context and data.

Design Considerations
Single, Consistent Flow
• Maintaining one path through the pipeline (Graph RAG → Merge → LLM ) avoids branching logic
that can complicate maintenance.
• If there’s no PDF, the “Merge” step simply merges the user’s query with the Graph RAG–retrieved
data, acting as a pass-through.

Modular Extensibility
• Later, you may add other optional data sources (e.g., user profile, previously uploaded documents, or
real-time market data). The “Merge” block is a natural place to combine them.
• Having a single merge node means no separate path is needed for the “no document” case.

Simplified Code and Orchestration

• Splitting the pipeline into separate routes (one bypassing “Merge” and one that doesn’t) introduces
extra branching or code paths.
• By treating “no PDF data” as an empty or null input, the “Merge” step still processes the user query
plus whatever Graph RAG context is available.

Overview of the Model Selection & Optimization Approach

1. Multimodal OCR
• Two Pre-Trained Models: Select two leading OCR solutions (e.g., from research papers or industry
benchmarks).
• Evaluation: Use a financially annotated dataset (tables, financial terms) to measure key metrics such
as Character/Word Error Rate and table-structure accuracy.
• Model Selection: Choose the model with the best overall performance (lowest errors, highest quality
output) based on standardized evaluation methods.

2. Finance LLM
• Two Pre-Trained Finance Models: Identify two specialized large language models tuned for fi-
nancial text.
• Testing Methods: Compare their domain-specific accuracy (factual consistency, clarity) using rec-
ognized finance NLP benchmarks.
• Best Model Choice: Select the LLM with superior performance on a set of financial queries or tasks.

2
3. Pipeline Optimization
• Efficiency Focus: Use architectures and techniques that minimize GPU/CPU consumption (e.g.,
quantization, pruning).
• Goal: Maintain strong performance for both OCR and LLM while reducing inference costs and resource
usage.

Hardware
– CPU: 8–12 cores for efficient preprocessing and orchestration.

– Memory: 32 GB system RAM; to manage resources effectively.

– GPU: High-memory GPU recommended with at least 16–24 GB VRAM ( NVIDIA RTX 3090, RTX
A5000, or A6000...) for fast inference and handling large model parameters.
– Storage: Fast NVMe SSD (500 GB or larger) for quick model loading and data caching.

Examplee
No ratings yet
Examplee
8 pages
AI Chatbot Docs
No ratings yet
AI Chatbot Docs
19 pages
Chatbot Documentation
No ratings yet
Chatbot Documentation
3 pages
Project
No ratings yet
Project
7 pages
LLM2
No ratings yet
LLM2
3 pages
Presentation 2 K
No ratings yet
Presentation 2 K
12 pages
RAG Project Understanding Document
No ratings yet
RAG Project Understanding Document
4 pages
LlamaIndex Talk (W&B Fully Connected 2024)
No ratings yet
LlamaIndex Talk (W&B Fully Connected 2024)
38 pages
Introduction To Docs and Image Based Voice Chatbots
No ratings yet
Introduction To Docs and Image Based Voice Chatbots
17 pages
Edi 5
No ratings yet
Edi 5
15 pages
RAG Chatbot Requirements
No ratings yet
RAG Chatbot Requirements
8 pages
2024 04 25 AI Bots Vitalii
No ratings yet
2024 04 25 AI Bots Vitalii
20 pages
Quick Start - RAGFlow
No ratings yet
Quick Start - RAGFlow
10 pages
Overview of Full Stack LLMs
No ratings yet
Overview of Full Stack LLMs
39 pages
Assignment
No ratings yet
Assignment
5 pages
Research Paper Outline AI RPA Workflows v1
No ratings yet
Research Paper Outline AI RPA Workflows v1
10 pages
LLM System Design
No ratings yet
LLM System Design
11 pages
RAG Pipeline: Enhancing LLMs with Chatbots
No ratings yet
RAG Pipeline: Enhancing LLMs with Chatbots
10 pages
LlamaIndex Talk (Data + AI Summit 2024)
No ratings yet
LlamaIndex Talk (Data + AI Summit 2024)
58 pages
Assignment
No ratings yet
Assignment
5 pages
Agents
No ratings yet
Agents
59 pages
RAG With LLM Fine Tuning
No ratings yet
RAG With LLM Fine Tuning
4 pages
Rag
No ratings yet
Rag
10 pages
Understanding Retrieval-Augmented Generation
No ratings yet
Understanding Retrieval-Augmented Generation
11 pages
Tayyab Final UResume
No ratings yet
Tayyab Final UResume
4 pages
Conversational AI for PDFs
No ratings yet
Conversational AI for PDFs
10 pages
An AI-Driven PDF Query System Leveraging OpenAI LLM and LangChain For Enhanced Data Retrieval (#1602597) - 4445287
No ratings yet
An AI-Driven PDF Query System Leveraging OpenAI LLM and LangChain For Enhanced Data Retrieval (#1602597) - 4445287
13 pages
DT Paper Springer
No ratings yet
DT Paper Springer
9 pages
OmniDocBench: Benchmarking Diverse PDF Document Parsing With Comprehensive Annotations
No ratings yet
OmniDocBench: Benchmarking Diverse PDF Document Parsing With Comprehensive Annotations
30 pages
AI Agent Workflow Vs Agent Part 5 by Vipra Singh Mar, 2025 Medium
No ratings yet
AI Agent Workflow Vs Agent Part 5 by Vipra Singh Mar, 2025 Medium
25 pages
Mega AI Agent Architecture and Workflow
No ratings yet
Mega AI Agent Architecture and Workflow
8 pages
Final NLP Course Project Report
No ratings yet
Final NLP Course Project Report
10 pages
Session 16 Building Application Using Gen AI - Case Studies
No ratings yet
Session 16 Building Application Using Gen AI - Case Studies
27 pages
Your First RAG
No ratings yet
Your First RAG
21 pages
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
No ratings yet
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
12 pages
AGENTIC RAG-Tech Stack
No ratings yet
AGENTIC RAG-Tech Stack
18 pages
Training Seminar
No ratings yet
Training Seminar
28 pages
GenAI Curriculum
No ratings yet
GenAI Curriculum
64 pages
Sithafal Project Tasks
No ratings yet
Sithafal Project Tasks
2 pages
Building A Complex, Production-Ready RAG System With LangChain, LangGraph, and RAGAS
No ratings yet
Building A Complex, Production-Ready RAG System With LangChain, LangGraph, and RAGAS
75 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
43 pages
Generative AI Curriculum
No ratings yet
Generative AI Curriculum
2 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
Figure Interview Notes 2
No ratings yet
Figure Interview Notes 2
31 pages
RAG Application AI Powered PDF QandA System
No ratings yet
RAG Application AI Powered PDF QandA System
11 pages
Build a Multimodal Knowledge Assistant
No ratings yet
Build a Multimodal Knowledge Assistant
40 pages
Future of Knowledge Assistants Explained
No ratings yet
Future of Knowledge Assistants Explained
30 pages
LLMs in Python Free Course by Inder P Singh
No ratings yet
LLMs in Python Free Course by Inder P Singh
28 pages
Loki
No ratings yet
Loki
7 pages
AI Integration Pipeline Guide
No ratings yet
AI Integration Pipeline Guide
14 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
Chatbot Solutions for Customer Service
No ratings yet
Chatbot Solutions for Customer Service
7 pages
PDF-CHAT: Streamlined PDF Insights
No ratings yet
PDF-CHAT: Streamlined PDF Insights
148 pages
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
3 pages
Chap 07
No ratings yet
Chap 07
14 pages
Hisense PDF
No ratings yet
Hisense PDF
2 pages
Assembly Instruction IRB 840/A
No ratings yet
Assembly Instruction IRB 840/A
54 pages
LC4 Single Cycle Diagram
No ratings yet
LC4 Single Cycle Diagram
1 page
7200 Series Radar Tank Gauges
No ratings yet
7200 Series Radar Tank Gauges
24 pages
LOYALTY-SECUTECH Product Catalog 2024Q4
No ratings yet
LOYALTY-SECUTECH Product Catalog 2024Q4
30 pages
PIC LED Tutorial for Beginners
No ratings yet
PIC LED Tutorial for Beginners
20 pages
Understanding the 555 Timer IC
100% (1)
Understanding the 555 Timer IC
6 pages
PSDB - CL (2) - IO Family Data Book
No ratings yet
PSDB - CL (2) - IO Family Data Book
18 pages
1992-12 The Computer Paper - Ontario Edition PDF
100% (1)
1992-12 The Computer Paper - Ontario Edition PDF
60 pages
Hand Gesture Control for Automotive Systems
No ratings yet
Hand Gesture Control for Automotive Systems
82 pages
LN40D550K1FXZA Troubleshooting Guide
No ratings yet
LN40D550K1FXZA Troubleshooting Guide
4 pages
Tribon Manual: PACE Curve Editing
100% (1)
Tribon Manual: PACE Curve Editing
56 pages
Class 3, 4, 5
No ratings yet
Class 3, 4, 5
4 pages
1st Practical OS
No ratings yet
1st Practical OS
11 pages
EU 12400 E-May10
No ratings yet
EU 12400 E-May10
62 pages
Cvim Manual
No ratings yet
Cvim Manual
11 pages
Ahb7032f2 LM V3
No ratings yet
Ahb7032f2 LM V3
1 page
Octal D-Type Latch HC563/573 Guide
No ratings yet
Octal D-Type Latch HC563/573 Guide
13 pages
WS Datasheet 150 Controller
No ratings yet
WS Datasheet 150 Controller
2 pages
LP141WX1 Tla2
No ratings yet
LP141WX1 Tla2
27 pages
Chain Conveyor System Optimization Analysis
No ratings yet
Chain Conveyor System Optimization Analysis
4 pages
RPM 8
100% (1)
RPM 8
3 pages
Agle UGS: Ve Hi Cle Type S: TT 4, TT 5, TT 6, & TT 8 Engine Type: Isuzu Diesel 4JB1 & 4JG1 Engine Models
100% (1)
Agle UGS: Ve Hi Cle Type S: TT 4, TT 5, TT 6, & TT 8 Engine Type: Isuzu Diesel 4JB1 & 4JG1 Engine Models
85 pages
Compressor Parts Inventory List
No ratings yet
Compressor Parts Inventory List
2 pages
Dell Technologies SC420 Datasheet
No ratings yet
Dell Technologies SC420 Datasheet
3 pages
Vivado Design Interface - Enabling CAD-Tool Design For Next Genera
No ratings yet
Vivado Design Interface - Enabling CAD-Tool Design For Next Genera
190 pages
Automation in Die-Casting
No ratings yet
Automation in Die-Casting
5 pages
File Return Code Status
No ratings yet
File Return Code Status
17 pages
Error Detection Techniques
No ratings yet
Error Detection Techniques
27 pages

Examplee 2

Uploaded by

Examplee 2

Uploaded by

Figure 1: Chatbot Pipeline

2. PDF Attached? (Decision Node)

3. Graph RAG & Structuring Agent

Simplified Code and Orchestration

Overview of the Model Selection & Optimization Approach

– Memory: 32 GB system RAM; to manage resources effectively.

You might also like