Interact With Document Using Gen AI

This document discusses the development of an application that utilizes Gen AI to interact with lengthy documents, enabling users to ask questions and receive summaries and FAQs efficiently. It outlines the architecture using AWS services, including API Gateway, ECS, and vector databases, while emphasizing the importance of maintaining context and security. The article also addresses challenges related to LLMs' context window size and offers strategies for improving user experience through caching and asynchronous operations.

Uploaded by

Iron man

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views2 pages

Interact With Document Using Gen AI

Uploaded by

Iron man

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

Interact with document using Gen AI

Are you tired of going through long documents looking for answers to your questions
? Ever wondered if you could get a summary of the lengthy documents ? Do you ever
need to generate FAQ(s) manually or would it be productive if you could click a
button and generate FAQ(s) from a document. If you are looking for answers, this
publication shall help you get started. It will help you build a quick application
that can interact with documents in a secure manner.

Published By :
Kamal Sharma
Image
Artificial intelligence has been touching our lives in one or another in a more
profound way since the advent of cloud computing. The big tech companies have
constantly leveraged power of AI to deliver lower costs to customers by suggesting
changes in the resources in cloud. The mini movie that gets generated on our
iPhones or Android phones too is a result of an AI algorithm in the background.
However, with the launch of ChatGPT, powered by GPT, owned by OPENAI, has made a
technology - GEN AI much more accessible to everybody. Business around the world
are brainstorming on what use cases they can use the technology for.

One thing is certain that we are just at the surface here and sky is the limit as
we progress through the decade. Companies and businesses around the world, even if
they do not want to, will be compelled and forced to provide better customer
experience and services. Imagine a travel site today requires 50+ click to plan an
itinerary but with the power of GEN AI, a bot can plan everything for you with few
sets of questions. This is just an example and potential is immense.

In this article, I would like to talk about a use cases we solved to reduce the
time spent by team members to go through a document but rather giving them a tool
at hand that can be used to answer specific question they might have from the
document. Instead of going through the document and using the compute of our brain,
handing it off to LLMs on AWS Bedrock, a productivity boost was achieved.

Let's dive into the architecture.

The above architecture uses a series of services across AWS :

API Gateway
ECS - for compute and cache layer
Vector Database - pickle files on S3
Foundational models such as Claude and Titan on AWS Bedrock
Amongst the libraries and components used, we used React for the frontend, FAST API
for the back end api along with lang chain to interact with AWS bedrock.

The architecture is pretty straightforward along with the customer journey. There
are 2 user stories associated with it - 1 : Uploading and preparing the document
for interaction. 2: Interacting with the document.

Find below the steps for customer journey for use case of uploading a file and
making it ready for interaction:

A user uploads a file

File gets stored on a S3 bucket
A backend process triggers to chunk the file using lang chain.
Conversational buffer is also used to keep the customer prompts and responses.
The chunks are individually vectorized by invoking Titan LLM model
FAISS is used as a in memory vector database
The vector data is serialized as a pickle file and stored in S3.
A notification is sent to the front end that file is ready for interaction.
User starts asking questions in "Natural Language". Example : What is this document
about ?
In the background another process to generate a summary of the document and
generate FAQ(s) is also triggered in the background so that it's ready for the
user.
The customer journey for interacting with the document is stated below:

Customer asks questions in Natural language.

A back end API vectorizes the question asked by the user.
In memory vector database - FAISS loads the respective pickle file from S3 and
performs the similarity search.
Similar texts along with the prompt is sent to LLM - Claude on AWS bedrock to
provide a response.
Since we use lang chain, the conversational buffer is maintained keeping the
context of all the conversations for the user.

It's amazing how the similar search also performs well in terms of out of context
questions being asked and does the best match [Link] system can also be made
smarter by using Agents (Bedrock Agents) to communicate to other systems as well.

Why choose Pickle files instead of using a vector data base such as Weviate or open
search ?

It's a valid question and it's a matter of tradeoff. When a user is interacting
with a document, the context needs to be of document itself. Having a shared vector
database such as open search will be much more nuance in this case. A lot more work
needed to be done for Authorization controls and making sure the right context is
picked as well. However, if you are building a knowledge base of documents that
does not need AuthZ or the data in the documents is not related at all, then using
systems such as AWS Kendra or vector DBs such as Open search should not be an
issue. In fact moving to pickle files in such case would not scale and also will
not be be an ideal solution.

Hence, no one strategy fits everything.

How do we summarize or generate FAQ(s) for larger documents - 100 MB+?

It's a great question to ask as well. Unfortunately, all LLMs are restricted by the
context window size. In order to generate Summary of FAQ the entire document must
be presented to the LLM so that appropriate response can be generated. But here the
context window size is a limiting factor. Claude is little ok with 100,000 limit.
Refer here.

There are certain model specific strategies to increase the context window size can
be employed. One of the other strategies could be an architecture such as below:

However, this should not be done as a Sync call since the user might end up waiting
for ever. Above all, the API gateway has a time out of 30 seconds max so connection
might time out.

To conclude, we looked at how an application can be architected that can help

interact with documents along with vectorizing it using Titan model and keeping the
context in a secure manner. There are also strategies on how to improve customer
experience by employing caching, lang chain conversational buffers, and performing
async operations such as summarizing and generating FAQ(s).

Architecting Scalable AI RAG Systems
No ratings yet
Architecting Scalable AI RAG Systems
32 pages
Build Generative AI Applications With Amazon Bedrock and Open Source Frameworks
No ratings yet
Build Generative AI Applications With Amazon Bedrock and Open Source Frameworks
24 pages
LLM Deployment Strategies and Insights
No ratings yet
LLM Deployment Strategies and Insights
5 pages
53 Streamlit
No ratings yet
53 Streamlit
6 pages
DocuChat AI - BDA Project
No ratings yet
DocuChat AI - BDA Project
6 pages
Personal Website & AI Projects Guide
No ratings yet
Personal Website & AI Projects Guide
6 pages
LangChain For JavaScript Developers How To Integrate LLMs Into Javascript Web Apps (Daniel Nastase) (Z-Library)
No ratings yet
LangChain For JavaScript Developers How To Integrate LLMs Into Javascript Web Apps (Daniel Nastase) (Z-Library)
120 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
34 pages
Generative AI Apps With Langchain and Python - Rabi Jay
100% (3)
Generative AI Apps With Langchain and Python - Rabi Jay
387 pages
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
No ratings yet
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
15 pages
Handout Build Your First Generative AI Application With Amazon Bedrock
No ratings yet
Handout Build Your First Generative AI Application With Amazon Bedrock
20 pages
File 34
No ratings yet
File 34
16 pages
Gena Iaws
No ratings yet
Gena Iaws
3 pages
Developing Generative Ai Applications On Aws
No ratings yet
Developing Generative Ai Applications On Aws
3 pages
Prompt Engineering for Content Creation
No ratings yet
Prompt Engineering for Content Creation
5 pages
Cheat Sheet AWS Certified AI Practitioner 250428 135214
No ratings yet
Cheat Sheet AWS Certified AI Practitioner 250428 135214
41 pages
Road Map For AI Engineer
No ratings yet
Road Map For AI Engineer
11 pages
Langchain Intro
No ratings yet
Langchain Intro
5 pages
AI Integration in React Apps Explained
No ratings yet
AI Integration in React Apps Explained
9 pages
Genai With Aws Cloud
No ratings yet
Genai With Aws Cloud
17 pages
FocusFlow PRD
No ratings yet
FocusFlow PRD
9 pages
LLM Framework - Documentation
100% (2)
LLM Framework - Documentation
23 pages
Building A Generative AI Platform
No ratings yet
Building A Generative AI Platform
26 pages
RP Journal-2
No ratings yet
RP Journal-2
54 pages
Building AI Applications On The Web v5 MEAP
No ratings yet
Building AI Applications On The Web v5 MEAP
379 pages
Prompt Engineering for LLMs on AWS
100% (1)
Prompt Engineering for LLMs on AWS
54 pages
AI Help Chat Widget - Comprehensive Solution Document
No ratings yet
AI Help Chat Widget - Comprehensive Solution Document
18 pages
6 Key Guidelines Building Secureon Amazon Bedrock
No ratings yet
6 Key Guidelines Building Secureon Amazon Bedrock
15 pages
Cloud Web Application Builder Documentation
80% (5)
Cloud Web Application Builder Documentation
58 pages
AI - Chatbot - Use - Case 1
No ratings yet
AI - Chatbot - Use - Case 1
2 pages
Overview of Full Stack LLMs
No ratings yet
Overview of Full Stack LLMs
39 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
685d06385da70d473162eaf2 TFY-Ebook
No ratings yet
685d06385da70d473162eaf2 TFY-Ebook
49 pages
Conversational AI for PDFs
No ratings yet
Conversational AI for PDFs
10 pages
Gen AI Use Cases
No ratings yet
Gen AI Use Cases
43 pages
Popular AI Agent Frameworks
No ratings yet
Popular AI Agent Frameworks
19 pages
Resume Zain Ali
No ratings yet
Resume Zain Ali
4 pages
GenAI Curriculum
No ratings yet
GenAI Curriculum
64 pages
LLM Frameworks
No ratings yet
LLM Frameworks
8 pages
Flowise AI Tutorial #3 File Loaders, Text Splitters, Embeddings & Vector Stores
No ratings yet
Flowise AI Tutorial #3 File Loaders, Text Splitters, Embeddings & Vector Stores
3 pages
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
No ratings yet
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
22 pages
Case Study
No ratings yet
Case Study
18 pages
AI Video Creation and Quiz Solution
No ratings yet
AI Video Creation and Quiz Solution
11 pages
Chat With PDFs Using Gen-AI and AWS Bedrock
No ratings yet
Chat With PDFs Using Gen-AI and AWS Bedrock
12 pages
Principles of Building AI Agents
No ratings yet
Principles of Building AI Agents
143 pages
SRH ChatGPT 20230115
No ratings yet
SRH ChatGPT 20230115
12 pages
AI-Powered Documentation Generator - Implementation Plan
No ratings yet
AI-Powered Documentation Generator - Implementation Plan
4 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Mini Project Docubot Power Point
No ratings yet
Mini Project Docubot Power Point
17 pages
GenAI Notes
No ratings yet
GenAI Notes
9 pages
? Ultimate Project Guide For 2025 Placements
No ratings yet
? Ultimate Project Guide For 2025 Placements
8 pages
Drumil Patel: Education Sal Engineering and Technical Institute
No ratings yet
Drumil Patel: Education Sal Engineering and Technical Institute
1 page
Agent Ai
No ratings yet
Agent Ai
30 pages
How To Create A Private ChatGPT With Your Own Data
No ratings yet
How To Create A Private ChatGPT With Your Own Data
11 pages
Anthropology: 1.7 The Biological Basis of Life
No ratings yet
Anthropology: 1.7 The Biological Basis of Life
45 pages
IMCAS Asia 2021: Virtual Conference Overview
No ratings yet
IMCAS Asia 2021: Virtual Conference Overview
10 pages
Edusync
No ratings yet
Edusync
11 pages
BUSI 651 - Week 3n
No ratings yet
BUSI 651 - Week 3n
24 pages
2022 06 01 Exam
No ratings yet
2022 06 01 Exam
5 pages
Mock Mid-Sessional Exam
No ratings yet
Mock Mid-Sessional Exam
7 pages
SEAM 2 (Revised)
67% (3)
SEAM 2 (Revised)
11 pages
Mechanical Engineering Resume
No ratings yet
Mechanical Engineering Resume
1 page
Constitutional Bylaws of Julita Association of Barangay Nutrition Scholars
No ratings yet
Constitutional Bylaws of Julita Association of Barangay Nutrition Scholars
5 pages
Admit Card
No ratings yet
Admit Card
1 page
How to Write an Effective Summary
No ratings yet
How to Write an Effective Summary
11 pages
Voice Controlled Wheel Chair Location
No ratings yet
Voice Controlled Wheel Chair Location
16 pages
Rubric Assignment CPE605 v4 - Presentation - Revised03052023
No ratings yet
Rubric Assignment CPE605 v4 - Presentation - Revised03052023
1 page
PD & CD Reports Seminar Guidelines - IAS 15.4.2022
No ratings yet
PD & CD Reports Seminar Guidelines - IAS 15.4.2022
2 pages
Bai Tap Chuyen Sau Anh 8 Global Unit 3 TEENAGERS
No ratings yet
Bai Tap Chuyen Sau Anh 8 Global Unit 3 TEENAGERS
14 pages
Demarcation in Science Education
No ratings yet
Demarcation in Science Education
25 pages
Reported Speech Worksheet For Grade 8
No ratings yet
Reported Speech Worksheet For Grade 8
2 pages
Begc 111
No ratings yet
Begc 111
5 pages
Learning Resource Enhancement Plan
No ratings yet
Learning Resource Enhancement Plan
2 pages
1572971385
No ratings yet
1572971385
9 pages
Understanding the Scientific Method
No ratings yet
Understanding the Scientific Method
26 pages
1st Unit Examination Sechedule 2025-26
No ratings yet
1st Unit Examination Sechedule 2025-26
1 page
Research Capsule
No ratings yet
Research Capsule
3 pages
Unit 3 Chocolate Our World 6 Worksheet
No ratings yet
Unit 3 Chocolate Our World 6 Worksheet
19 pages
Non-Catholic Students' Adaptability Survey
No ratings yet
Non-Catholic Students' Adaptability Survey
4 pages
GRAVICAL Results - Novice Women Qualifiers
No ratings yet
GRAVICAL Results - Novice Women Qualifiers
2 pages
UAF Faculty & Staff Handbook (2024-25)
No ratings yet
UAF Faculty & Staff Handbook (2024-25)
63 pages
Chivvy Poem
100% (1)
Chivvy Poem
15 pages
Iit-Jee (M+a) Nurture Act-1 Paper-2 25-09-2022
No ratings yet
Iit-Jee (M+a) Nurture Act-1 Paper-2 25-09-2022
20 pages
ENG-102 (Functional Literacy Across Disciplines)
No ratings yet
ENG-102 (Functional Literacy Across Disciplines)
10 pages