Services Overview

Uploaded by

gourav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Services Overview

Uploaded by

gourav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Services Overview (Retriever and Vectorization)

A)Retriever service:
This service essentially acts as a document retrieval and question-
answering system that:
1. Takes a user query
2. Searches through indexed documents in Pinecone
3. Uses GPT-4 to generate answers based on relevant document
chunks
4. Returns structured results with source information
 components and functionality:
1. Imports and Dependencies (key libraries used):
 pinecone: For vector database operations
 Langchain: For working with LLMs and document processing
 FastAPI: For creating the web API
 pandas: For data manipulation
2. Main Components:
a) Helper Function:
def extract_unique_sources(query_result):
 This function extracts unique document IDs from Pinecone query
results
 It processes the matches from the query result and collects
unique document IDs
b) API Setup:
 Uses FastAPI framework
 Creates a router with prefix "/api/v1"
 Has two endpoints:
 /api/v1/steps/retriever (POST)
 /api/v1/health (GET)
3. Main Retriever Endpoint (/api/v1/steps/retriever):
This is the core functionality that:
 Accepts POST requests with JSON data containing:
 query: The user's question
 restrictToDocumentIds: list of specific documents to search
 documentType: Type of document to filter
 documentcategory: Category of document to filter
4. Configuration:
 Uses environment variables for various credentials:
 OpenAI/Azure credentials (API base, key, version, type)
 Pinecone credentials (API key, environment, index name)
5. Core Processing Flow:
a) Setup:
 Initializes Azure OpenAI LLM (GPT-4)
 Sets up OpenAI embeddings (using Ada model)
 Initializes Pinecone connection
b) Document Retrieval:
 Queries Pinecone index with filters based on:
 Document type
 Document category
 Specific document IDs
 Retrieves up to 1000 matches (configurable)
c) Processing:
For each unique document:
 Creates a RetrievalQA chain
 Processes the user's query
 Stores results in a pandas DataFrame with columns:
o documentId
o documentCategory
o documentType
o fileName
o result
6. Output:
Returns JSON response containing:
 results: List of processed documents and their answers
 errors: Any internal errors that occurred
7. Error Handling:
 Includes basic error handling for value errors
 Returns appropriate HTTP status codes (200 for success, 400 for
errors)

B)Vectorization service:
 Responsible for processing documents and converting them into
vector embeddings for efficient retrieval.
 The main business purpose of this service is to:
1. Take HTML documents
2. Process them into searchable chunks
3. Convert these chunks into vector embeddings
4. Store them in a vector database for efficient semantic search

 This service works in conjunction with the retriever service, where:

 This service (vectorization) prepares and stores the documents.
 The retriever service uses these stored vectors to find relevant
information when answering questions.
 components and functionality:
1. Service Overview:
This is a FastAPI-based service that converts documents (specifically
HTML documents) into vector embeddings and stores them in
Pinecone (a vector database). It has two main endpoints for
vectorization.
2. Main Components:
a) Setup and Configuration:
 Uses FastAPI framework
 Loads environment variables from .env file
 Sets up logging
 Has three endpoints:
 /api/v1/steps/vectorize-new (POST)
 /api/v1/steps/vectorize (POST)
 /api/v1/health (GET)
3. First Vectorization Endpoint (/api/v1/steps/vectorize-new):
This is the newer version of the vectorization endpoint that uses a
more structured approach:
a) Input Processing:
 Accepts JSON with two main sections:
 inputs: Contains file path, document ID, type, and category
 config: Contains configuration for embeddings and processing
b) Processing Flow:
1. Validates credentials (OpenAI and Pinecone)
2. Sets up Azure OpenAI embeddings
3. Loads HTML file from blob storage
4. Uses a Vectorizer class to:
 Load and process chunks
 Augment vectors with metadata
 Save vectors to database
4. Second Vectorization Endpoint (/api/v1/steps/vectorize):
This is the legacy version that handles the vectorization process
directly:
a) Input Processing: Accepts simpler JSON with -
 instanceId
 pathToInputFile
 documentType
 documentId
 documentcategory
b) Processing Flow:
1. Validates credentials
2. Loads HTML file from blob storage
3. Processes the document:
 Uses UnstructuredHTMLLoader to load HTML
 Splits text into chunks using RecursiveCharacterTextSplitter
 Adds metadata to each chunk
 Stores vectors in Pinecone
5. Key Business Logic:
a) Document Processing:
 Documents are loaded from Azure Blob Storage
 HTML content is parsed and split into manageable chunks
 Each chunk is converted into a vector embedding
 Metadata (document type, ID, category) is attached to each chunk
b) Vector Storage:
 Uses Pinecone as the vector database
 Stores vectors with associated metadata for later retrieval
 Uses Azure OpenAI's embeddings model (Ada)
6. Error Handling:
Comprehensive error handling for:
 Missing credentials
 File loading issues
 Processing errors
 Database operations
 Returns appropriate HTTP status codes and error messages
7. Integration Points:
 Azure Blob Storage: For document storage
 Azure OpenAI: For generating embeddings
 Pinecone: For vector storage
 FastAPI: For API endpoints

AI Local Solution Components
No ratings yet
AI Local Solution Components
5 pages
Python Full Deeply Explanation
No ratings yet
Python Full Deeply Explanation
8 pages
Fast API 1748886098
No ratings yet
Fast API 1748886098
8 pages
OCR & Groq: Fast Data Extraction
No ratings yet
OCR & Groq: Fast Data Extraction
17 pages
Comprehensive Local AI LLM System Architecture v3.0
No ratings yet
Comprehensive Local AI LLM System Architecture v3.0
12 pages
Byte Brawl
No ratings yet
Byte Brawl
11 pages
Personal Website & AI Projects Guide
No ratings yet
Personal Website & AI Projects Guide
6 pages
Gen AI Use Cases
No ratings yet
Gen AI Use Cases
43 pages
Architecture
No ratings yet
Architecture
3 pages
Final Project-2
No ratings yet
Final Project-2
12 pages
AI Odyssey Use Cases
No ratings yet
AI Odyssey Use Cases
7 pages
Extracting Data From An API On Databricks - by Ryan Chynoweth - Feb, 2024 - Medium
No ratings yet
Extracting Data From An API On Databricks - by Ryan Chynoweth - Feb, 2024 - Medium
12 pages
Python - Backend and AI - ML - Job Description
No ratings yet
Python - Backend and AI - ML - Job Description
2 pages
Task
No ratings yet
Task
3 pages
01 Functional Requirements CV Projects-3
No ratings yet
01 Functional Requirements CV Projects-3
7 pages
Ok So As Infrastructure What Does A Contractor Has
No ratings yet
Ok So As Infrastructure What Does A Contractor Has
3 pages
Examplee
No ratings yet
Examplee
8 pages
Engineering Onboarding & Tech Stack Overview
No ratings yet
Engineering Onboarding & Tech Stack Overview
5 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
RAG System PoC Using Ragbuilder APIs
No ratings yet
RAG System PoC Using Ragbuilder APIs
7 pages
HLD - Crowdsourced Civic Issue Reporting & Resolution System
100% (2)
HLD - Crowdsourced Civic Issue Reporting & Resolution System
6 pages
??-??????? ????? ?????????
No ratings yet
??-??????? ????? ?????????
35 pages
CV Nguyen Thi Uoc
No ratings yet
CV Nguyen Thi Uoc
2 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
34 pages
LLM2
No ratings yet
LLM2
3 pages
Agentic AI Approval Document Requirements
No ratings yet
Agentic AI Approval Document Requirements
6 pages
DocuMorph AI Project Cloud 100 Page Formatter
No ratings yet
DocuMorph AI Project Cloud 100 Page Formatter
6 pages
Synopsis
No ratings yet
Synopsis
3 pages
One-Month Crash Course - Implementing RAG Architecture With Python, FastAPI, and Vector Search
No ratings yet
One-Month Crash Course - Implementing RAG Architecture With Python, FastAPI, and Vector Search
4 pages
Agentic AI
No ratings yet
Agentic AI
12 pages
Saurabh Rathore CV
No ratings yet
Saurabh Rathore CV
2 pages
Document RAG Assignment
No ratings yet
Document RAG Assignment
4 pages
FastApi Theory
No ratings yet
FastApi Theory
22 pages
AI Research Assistant Backend Report
No ratings yet
AI Research Assistant Backend Report
4 pages
AI Engineer Roadmap
No ratings yet
AI Engineer Roadmap
22 pages
Assignment
No ratings yet
Assignment
5 pages
AI ML Fullstack Career Roadmap
No ratings yet
AI ML Fullstack Career Roadmap
9 pages
Kragentic-Orchestrator Poc v2.1
No ratings yet
Kragentic-Orchestrator Poc v2.1
10 pages
Fastapi Basics For Rest Apis Blog Post
No ratings yet
Fastapi Basics For Rest Apis Blog Post
5 pages
FastAPI Contrib Guide for Developers
No ratings yet
FastAPI Contrib Guide for Developers
55 pages
RAG System Backend Assignment
No ratings yet
RAG System Backend Assignment
4 pages
AI Stack 2025
No ratings yet
AI Stack 2025
81 pages
Report
No ratings yet
Report
11 pages
RAG With LLM Fine Tuning
No ratings yet
RAG With LLM Fine Tuning
4 pages
Assignment
No ratings yet
Assignment
5 pages
BT4221 FinalCheatsheet
No ratings yet
BT4221 FinalCheatsheet
2 pages
Project Ti
No ratings yet
Project Ti
13 pages
? AI-Powered Knowledge Hub - Full-Stack Coding Challenge
No ratings yet
? AI-Powered Knowledge Hub - Full-Stack Coding Challenge
4 pages
Fast API
No ratings yet
Fast API
14 pages
Data Science Techniques Overview
No ratings yet
Data Science Techniques Overview
196 pages
Product Manager Task334
No ratings yet
Product Manager Task334
2 pages
Hsin-Yu (Bryce) Huang: Ducation
No ratings yet
Hsin-Yu (Bryce) Huang: Ducation
1 page
OpenAI SDK Developer Guide Quickstart JS Python v1
No ratings yet
OpenAI SDK Developer Guide Quickstart JS Python v1
13 pages
Social Media RAG Platform - Complete System Design & Implementation Guide
No ratings yet
Social Media RAG Platform - Complete System Design & Implementation Guide
34 pages
Multi Agent Application Roadmap
No ratings yet
Multi Agent Application Roadmap
3 pages
InfluxDB Client User Guide
No ratings yet
InfluxDB Client User Guide
123 pages
Roadmap Ai
No ratings yet
Roadmap Ai
3 pages
Step-by-Step Guide to ER Diagrams
No ratings yet
Step-by-Step Guide to ER Diagrams
6 pages
DWM Lab Manual
No ratings yet
DWM Lab Manual
92 pages
Software Testing Question Papers
No ratings yet
Software Testing Question Papers
21 pages
Pick Pack and Ship Using Shipping Application Programming Interface - 06 - 06
No ratings yet
Pick Pack and Ship Using Shipping Application Programming Interface - 06 - 06
20 pages
Job Description IT Incharge
No ratings yet
Job Description IT Incharge
3 pages
A Business Continuity Plan
100% (1)
A Business Continuity Plan
3 pages
Java Airline Reservation System Project
No ratings yet
Java Airline Reservation System Project
4 pages
CIS 4365 Corporate Security Policy Mid-Term - Answer - Key
No ratings yet
CIS 4365 Corporate Security Policy Mid-Term - Answer - Key
8 pages
SIEMonster 4.6 Community Edition Administrator Guide-V1.1.-20210810 - Ubuntu
No ratings yet
SIEMonster 4.6 Community Edition Administrator Guide-V1.1.-20210810 - Ubuntu
253 pages
Google Trends Detailed Presentation
No ratings yet
Google Trends Detailed Presentation
16 pages
Change Profit Center Master Data Guide
No ratings yet
Change Profit Center Master Data Guide
15 pages
Chapter 8-Operating System Notes 2
No ratings yet
Chapter 8-Operating System Notes 2
44 pages
Conducting An Incident Response Investigation 4e - Tamirah Williams
100% (1)
Conducting An Incident Response Investigation 4e - Tamirah Williams
15 pages
Dangers of Keyloggers in Cybercrime
100% (2)
Dangers of Keyloggers in Cybercrime
17 pages
Oracle BI and ODI Configuration Guide
No ratings yet
Oracle BI and ODI Configuration Guide
42 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 10 - Creating Reports and Dashboards
100% (1)
Splunk Fundamentals 1 Lab Exercises: Lab Module 10 - Creating Reports and Dashboards
10 pages
Safe Transportation System Overview
No ratings yet
Safe Transportation System Overview
29 pages
Data Science Tools and Techniques Guide
No ratings yet
Data Science Tools and Techniques Guide
139 pages
Edge Computing Poster
No ratings yet
Edge Computing Poster
1 page
cb3401 Unit 2
No ratings yet
cb3401 Unit 2
24 pages
APO Decomission List FY18 Phase 1 PROG
No ratings yet
APO Decomission List FY18 Phase 1 PROG
59 pages
Understanding Stack Data Structures
No ratings yet
Understanding Stack Data Structures
19 pages
Systems Engineering Prelim Reviewer
No ratings yet
Systems Engineering Prelim Reviewer
5 pages
RecyclerView Layout Error Debugging
No ratings yet
RecyclerView Layout Error Debugging
2 pages
RADIUS Authentication for Panorama Admins
No ratings yet
RADIUS Authentication for Panorama Admins
5 pages
BYOD Smart Solutions: Suggested RFP Questions
No ratings yet
BYOD Smart Solutions: Suggested RFP Questions
7 pages
Alert Log Recommendation - "Increase Per Process Memlock (Soft) Limit To at Least MB To Lock % of SHARED GLOBAL AREA (SGA) Pages Into Physical Memory" (Doc ID 2049901.1)
100% (1)
Alert Log Recommendation - "Increase Per Process Memlock (Soft) Limit To at Least MB To Lock % of SHARED GLOBAL AREA (SGA) Pages Into Physical Memory" (Doc ID 2049901.1)
3 pages
How To Add A Context Dependent Descriptive Flex SSHR
No ratings yet
How To Add A Context Dependent Descriptive Flex SSHR
7 pages
Aws CP Complete Notes
No ratings yet
Aws CP Complete Notes
324 pages
McDonald's Data Breach Case Study
No ratings yet
McDonald's Data Breach Case Study
2 pages

Services Overview

Uploaded by

Services Overview

Uploaded by

Services Overview (Retriever and Vectorization)

 This service works in conjunction with the retriever service, where:

You might also like