0% found this document useful (0 votes)

10 views6 pages

Python Lists

The document provides an overview of Python data structures including lists, tuples, sets, and dictionaries, detailing their characteristics and operations. It also outlines internship tasks related to AI chatbot development, document analysis in supply chain management, and enterprise automation projects, emphasizing the use of advanced technologies like GPT-4o and Azure Document Intelligence. Additionally, it describes the Retrieval-Augmented Generation (RAG) architecture for enhancing language model responses through external knowledge integration.

Uploaded by

justpics.tanvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Python Lists

Uploaded by

justpics.tanvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Python Lists:

 Built in data-type
 Mylist = [“apples”, 1, True]
 Can store values of different data types like str integer Boolean
 Are changeable : able to add, delete values
 Ordered lists are those which have a specific order in which elements are ordered
 Unordered means no specific order or sequence of items
 Allows duplicates (mylist =[“apples”, “cherry”, “apples”])
 Can also create a list using list() constructor
 To change item value:
thislist = ["apple", "banana", "cherry"]
thislist[1] = "blackcurrant"
 Adding items to list
o thislist.append("orange")
o thislist.insert(1, "orange")
 append 2 lists:
o thislist.extend(tropical)
o list1+list2
 extend a different data type to list: thislist.extend(thistuple) where thistuple is a tuple
 removing items
o thislist.remove("banana")
o pop() to remove last item
o clear() to empty list contents but list still remains to exist
o del mylist to delete entire lit
o del mylist[1] to delete specific index item
 make a copy of list:
o mylist.copy()
o mylist[:]
o list(mylist)
 list comprehension: offers a shorter syntax to create a list based on values of an old/pre-
existing list:
can replace below code :
for x in fruits:
if "a" in x:
newlist.append(x)

print(newlist)
as so:
newlist = [x for x in fruits if "a" in x]
print(newlist)
Python Tuple

 ordered
 unchangeable/immutable i.e cannot add, delete item after a tuple is created but we can
convert a tuple to a list using the list() function and then perform add/update/remove
operation nd convert this list back to a tuple using tuple() function
 allows duplicates
thistuple = ("apple",)
print(type(thistuple))

#NOT a tuple
thistuple = ("apple")
print(type(thistuple))
 can contain different data types in one tuple
 can create a tuple directly :
o mytuple = (“apple”,”banana”)
o thistuple = tuple(("apple", "banana", "cherry")) # note the double
round-brackets
 unpacking a tuple:

Python Set:

 unordered myset = {"apple", "banana", "cherry"}

 Once a set is created, you cannot change its items, but you can remove items and add new
items.
 Duplicates Not Allowed. The values True and 1 are considered the same value in sets, and are
treated as duplicates:
 thisset = set(("apple", "banana", "cherry")) # note the double round-brackets
 add items to set: thisset.add("orange")
 add 2sets: thisset.update(tropical)
 add an iterable to set: thisset.update(mylist)
 to remove an item from set: thisset.remove("banana")
 The union() and update() methods joins all items from both sets.
 The intersection() method keeps ONLY the duplicates.
 The difference() method keeps the items from the first set that are not in the other set(s).
 The symmetric_difference() method keeps all items EXCEPT the duplicates.
Python dictonary:

 Dictionary items are ordered, changeable, and do not allow duplicates.

 Change/add values:
o thisdict.update({"key": value})
o thisdict["key"] = value
 to remove item:
o thisdict.pop("model")
o del thisdict["model"]
o thisdict.clear()

Internship details. manual

task 1: comes under oil and gas sector

- Created a chatbot using ai that answers to user query based on information available in db
- Db consists of pdf files,word docs, excel files
- All file types are converted to pdf. Word to pdf using libreoffice, excel using langchain
unstructured document loaders. Pdf loaded using langchain document loaders.
- Now all files are available in pdf format.
- Gpt4o used to extract pdf content and store into langchain docs
- Embeddings created for entire extracted content using text-embedding-ada-002 model and
stored in vector store
- Use RecursiveCharacterTextSplitter to split large docs while creating embeddings
- Rag pipeline created to respond to user query
- Use conversational chain buffer memory to maintain chat history for user
- Created an api to handle all this application
- Used postman to test results
- Solving trivy scanner nd sonar cube errors
- Deployed api using docker image on azure git

task 2: supply chain management sector

- Explored azure document intelligence for extracting tables from complex excel sheets.
- Extracting information from vendor bids to understand availability of piping system material
or plant material according to our specifications.
- Analyzing vendor quotation and specification to choose the best fit
- Recommending the best fit to decision making authority

Task 3: enterprise automation project

- Part of a project that generated responses for EOI documents.

- Embeddings created for multiple sector databases like oil and gas, chemical, construction,
etc.
- Workflow like task 1
- Multiple API application
- worked on creating a custom retriever to cache results for frequent queries and use a hybrid
scoring method of Weight cosine similarity + keyword match

__________________________________________________________________________________

Task 1: AI-Powered Chatbot for Oil & Gas Sector

 Designed and implemented an AI-driven chatbot capable of responding to user queries by

leveraging document-based knowledge stored in a centralized database.

 Consolidated heterogeneous data sources including PDF files, Word documents, and Excel
sheets. Utilized LibreOffice for .doc to .pdf conversions and LangChain's unstructured
document loaders for Excel files to ensure standardized document formatting.

 Processed all documents in PDF format using LangChain PDF loaders, enabling consistent
parsing and preprocessing.

 Leveraged GPT-4o for extracting semantically rich content from PDFs and converting them
into LangChain document objects.

 Created embeddings for all processed content using OpenAI’s text-embedding-ada-002

model and stored them in a high-performance vector database, enabling rapid semantic
search and retrieval.

 Employed RecursiveCharacterTextSplitter to manage large document chunks effectively

during embedding generation, optimizing retrieval accuracy.

 Built a Retrieval-Augmented Generation (RAG) pipeline to provide contextual, accurate

answers based on stored document knowledge.

 Integrated LangChain’s ConversationalRetrievalChain with buffer memory to maintain

contextual continuity in multi-turn conversations.

 Developed a FastAPI-based backend API to orchestrate the end-to-end chatbot operations,

from query intake to response generation.

 Validated API functionality and performance using Postman, ensuring reliability across
various input cases.

 Addressed security and quality issues by resolving vulnerabilities flagged by Trivy (container
security) and SonarQube (static code analysis).

 Containerized the application and deployed it using Docker, integrating with Azure Git for
version control and CI/CD readiness.

Business Impact: Enhanced operational efficiency by providing engineers and field experts with real-
time access to domain-specific documentation and insights, significantly reducing time spent on
manual document search.

Task 2: Intelligent Document Analysis for Supply Chain Management

 Explored and applied Azure Document Intelligence (formerly Form Recognizer) to extract
structured data, particularly complex tabular information from Excel-based vendor bids.
 Automated the identification and extraction of key metrics such as material availability,
specifications, and compliance for piping systems and plant materials.

 Conducted comparative analysis of vendor quotations using extracted insights to assist in

optimal supplier selection.

 Delivered data-backed recommendations to procurement and decision-making teams,

enabling informed and faster purchasing decisions.

Business Impact: Streamlined vendor evaluation processes by automating bid analysis, leading to
reduced procurement cycle time and improved supply chain transparency.

Task 3: Enterprise Automation – Expression of Interest (EOI) Response Generation

 Contributed to an enterprise-grade automation project aimed at generating responses for

EOI documents across various sectors, including Oil & Gas, Construction, and Chemicals.

 Built sector-specific document embeddings using a pipeline similar to Task 1, enabling

intelligent matching and content generation for EOI requirements.

 Developed and managed multiple APIs to handle modular tasks within the application such
as retrieval, ranking, and response generation.

 Engineered a custom retriever capable of caching frequently asked queries, implementing a

hybrid retrieval approach that combined weighted cosine similarity with keyword matching
for improved ranking accuracy and reduced latency.

Business Impact: Automated a previously manual and time-intensive process, increasing proposal
generation speed and enabling scalable client engagements across sectors.

RAG architecture:

-RAG as “a general-purpose fine-tuning recipe” designed to integrate any large language model (LLM)
with various internal or external knowledge sources.

-RAG gives an LLM a superpower: the ability to consult an external knowledge base before crafting its
responses.

1. Question Input: The client inputs a question into the system. This initiates the process by
feeding the query into the framework.

2. Semantic Search: The framework employs semantic search techniques to query the vector
database. This search retrieves relevant contextual data based on the input question.

3. Contextual Data Utilization: The retrieved data is then used to create a prompt. This prompt
is specifically tailored to guide the LLM in generating a response that is both relevant and
informative.

4. Response Generation by LLM: The LLM processes the prompt and generates a response. The
LLM’s extensive training on vast datasets enables it to produce high-quality answers.
5. Post-Processing: The generated response undergoes post-processing to ensure clarity,
coherence, and appropriateness. This step may involve refining the language, correcting
errors, and enhancing the overall quality of the response.

6. Response Delivery: The final, polished response is delivered back to the client, providing
them with the information they sought in a clear and concise manner.

Use delta query to track changes

(Use delta query to track changes in Microsoft Graph data - Microsoft Graph | Microsoft Learn)

Hyperparamaters in llm

Interview Questions UBS
No ratings yet
Interview Questions UBS
7 pages
Harshit AI ML Engineer
No ratings yet
Harshit AI ML Engineer
4 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
RAG With LLM Fine Tuning
No ratings yet
RAG With LLM Fine Tuning
4 pages
Sahil Garg Updated For Azure
No ratings yet
Sahil Garg Updated For Azure
8 pages
CS F469 IR System Assignment
No ratings yet
CS F469 IR System Assignment
4 pages
Anas Anwer
No ratings yet
Anas Anwer
2 pages
Python Full Deeply Explanation
No ratings yet
Python Full Deeply Explanation
8 pages
Assignment For Applied AI Engineer (RAG Pipeline) Role
No ratings yet
Assignment For Applied AI Engineer (RAG Pipeline) Role
4 pages
Build Scalable RAG-Based LLM Apps
100% (2)
Build Scalable RAG-Based LLM Apps
39 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
Prompt Engineering & Ai
No ratings yet
Prompt Engineering & Ai
22 pages
Shyena Consultant Ayush S MLOps 5+ Years
No ratings yet
Shyena Consultant Ayush S MLOps 5+ Years
5 pages
Azure GenAI Engineer Interview Prep LTIMindtree
No ratings yet
Azure GenAI Engineer Interview Prep LTIMindtree
5 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
34 pages
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
No ratings yet
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
83 pages
Python Career Mastery - 10 Elite Learning Prompts
No ratings yet
Python Career Mastery - 10 Elite Learning Prompts
10 pages
RESUME - Enosh Raj Paul
No ratings yet
RESUME - Enosh Raj Paul
2 pages
Prithu Mazumdar
No ratings yet
Prithu Mazumdar
6 pages
PROJECT
No ratings yet
PROJECT
32 pages
Nihal ATS 2
No ratings yet
Nihal ATS 2
3 pages
Advanced Data Query Techniques
100% (1)
Advanced Data Query Techniques
5 pages
Satyam Mishra Resume (2025)
No ratings yet
Satyam Mishra Resume (2025)
2 pages
Gen AI Use Cases
No ratings yet
Gen AI Use Cases
43 pages
Certified Generative Ai Engineer Associate
No ratings yet
Certified Generative Ai Engineer Associate
25 pages
Building RAG-based LLM Applications For Production: Blog Detail
No ratings yet
Building RAG-based LLM Applications For Production: Blog Detail
78 pages
2
No ratings yet
2
8 pages
How I Built A Basic RAG For PDF QA in A Few Lines of Python Code - by DR Julija - Medium
No ratings yet
How I Built A Basic RAG For PDF QA in A Few Lines of Python Code - by DR Julija - Medium
8 pages
Dynamic IT Architecture Tool
No ratings yet
Dynamic IT Architecture Tool
45 pages
Interview Task 1
No ratings yet
Interview Task 1
2 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
AI & RAG for Exam Prep
No ratings yet
AI & RAG for Exam Prep
16 pages
Langchain Guide
No ratings yet
Langchain Guide
11 pages
Nikita Resume CV
No ratings yet
Nikita Resume CV
2 pages
Source Code Analysis Using Generative AI
No ratings yet
Source Code Analysis Using Generative AI
3 pages
RAG Chatbot For College Support Project Report
0% (1)
RAG Chatbot For College Support Project Report
40 pages
CV NguyenVanTuan
No ratings yet
CV NguyenVanTuan
3 pages
Rapport ISTIC 2023 2024 Ilef Tasnim
No ratings yet
Rapport ISTIC 2023 2024 Ilef Tasnim
94 pages
Nikita Resume Up
No ratings yet
Nikita Resume Up
3 pages
Master Thesis
No ratings yet
Master Thesis
58 pages
Cody Mckeand Resume-Lang
No ratings yet
Cody Mckeand Resume-Lang
5 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
S Rivas An Sridhar An Resume
No ratings yet
S Rivas An Sridhar An Resume
6 pages
GenAIRAG LLM 71731191 PDF
No ratings yet
GenAIRAG LLM 71731191 PDF
32 pages
0th Review
No ratings yet
0th Review
8 pages
Final Year Project
No ratings yet
Final Year Project
28 pages
Prescreen Test Junior Workflow Automation Engineer
No ratings yet
Prescreen Test Junior Workflow Automation Engineer
10 pages
Venkat Balaji S - Resume Cognizant
No ratings yet
Venkat Balaji S - Resume Cognizant
1 page
RAG System Backend Assignment
No ratings yet
RAG System Backend Assignment
4 pages
Ramya Gen AI ML Engineer
No ratings yet
Ramya Gen AI ML Engineer
5 pages
Resume Python Developer
No ratings yet
Resume Python Developer
3 pages
Internship Report Part2 Main Chapters
No ratings yet
Internship Report Part2 Main Chapters
3 pages
340 Resume Template Ats Black and White
No ratings yet
340 Resume Template Ats Black and White
3 pages
Architecture
No ratings yet
Architecture
1 page
AI-web Scraping
No ratings yet
AI-web Scraping
18 pages
AI Product Review Analysis Insights
No ratings yet
AI Product Review Analysis Insights
7 pages
Decoding Reviews - AI-Powered Product Review Analysis-Final Draft
No ratings yet
Decoding Reviews - AI-Powered Product Review Analysis-Final Draft
7 pages
Mongodb 2
No ratings yet
Mongodb 2
1 page
MongoDB Is A Popular NoSQL Database That Utilizes A Document Model
No ratings yet
MongoDB Is A Popular NoSQL Database That Utilizes A Document Model
1 page
PDF DF Merged
No ratings yet
PDF DF Merged
14 pages
Enhancing Sentence Similarity with Cosine
No ratings yet
Enhancing Sentence Similarity with Cosine
16 pages
6809da324a3a31a92136ac31 ## File
No ratings yet
6809da324a3a31a92136ac31 ## File
3 pages
Chennai Metro Phase II Socio-Economic Survey
No ratings yet
Chennai Metro Phase II Socio-Economic Survey
23 pages
Supply and Demand
No ratings yet
Supply and Demand
13 pages
8011 Gurps Wwii - Motor Pool
100% (3)
8011 Gurps Wwii - Motor Pool
131 pages
Dod Energy Management Handbook
No ratings yet
Dod Energy Management Handbook
250 pages
Year 11 History Source Analysis
No ratings yet
Year 11 History Source Analysis
5 pages
Chapter 12 Evaluation
No ratings yet
Chapter 12 Evaluation
7 pages
The Human Side of Business Research: Organizational and Ethical Issues
No ratings yet
The Human Side of Business Research: Organizational and Ethical Issues
21 pages
Training Session Evaluation Form Organic Agriculture Production Ncii
No ratings yet
Training Session Evaluation Form Organic Agriculture Production Ncii
5 pages
How To Get Your Prayers Answered, 0892742151
No ratings yet
How To Get Your Prayers Answered, 0892742151
34 pages
CARE Certification International List
No ratings yet
CARE Certification International List
44 pages
טכניקות לגירוש שדים + אנגלית
100% (1)
טכניקות לגירוש שדים + אנגלית
24 pages
TTX 2019 Ncov Poe 16022020 Final Generic
No ratings yet
TTX 2019 Ncov Poe 16022020 Final Generic
41 pages
B.A Semester-2 Admit Card 2019
No ratings yet
B.A Semester-2 Admit Card 2019
1 page
Approved Training Organisations (ATO) in Switzerland
No ratings yet
Approved Training Organisations (ATO) in Switzerland
17 pages
Geological Evolution of Java's Pleistocene
No ratings yet
Geological Evolution of Java's Pleistocene
8 pages
C.R.E. - CRE Form 1 - Marking Scheme
No ratings yet
C.R.E. - CRE Form 1 - Marking Scheme
10 pages
RKWORLD
No ratings yet
RKWORLD
3 pages
The Effect of Temperature On The Shock-Absorbing Properties of The Industrial Polymer D3O Poster
No ratings yet
The Effect of Temperature On The Shock-Absorbing Properties of The Industrial Polymer D3O Poster
1 page
Andromeda Council: Elenin, Tekoma, Nibiru Explained
No ratings yet
Andromeda Council: Elenin, Tekoma, Nibiru Explained
4 pages
KORG MS-20 Controller Owner's Manual
No ratings yet
KORG MS-20 Controller Owner's Manual
16 pages
Bank Statement October 2023
No ratings yet
Bank Statement October 2023
14 pages
Car South Africa - January 2024
No ratings yet
Car South Africa - January 2024
156 pages
Database Management Systems Overview
No ratings yet
Database Management Systems Overview
8 pages
APL2 Lecture 02
No ratings yet
APL2 Lecture 02
52 pages
Winter Internship Report
No ratings yet
Winter Internship Report
39 pages
Coffee Pot
No ratings yet
Coffee Pot
4 pages
Nortel Secure Network Access Switch PDF
No ratings yet
Nortel Secure Network Access Switch PDF
526 pages
Visionaries of Modern Business Elon Musk and Mark Zuckerberg PDF
No ratings yet
Visionaries of Modern Business Elon Musk and Mark Zuckerberg PDF
12 pages
Significant Beneficial Owner Rules
No ratings yet
Significant Beneficial Owner Rules
33 pages

Python Lists

Uploaded by

Python Lists

Uploaded by

Python Lists:

 unordered myset = {"apple", "banana", "cherry"}

 Dictionary items are ordered, changeable, and do not allow duplicates.

Internship details. manual

task 1: comes under oil and gas sector

task 2: supply chain management sector

Task 3: enterprise automation project

- Part of a project that generated responses for EOI documents.

Task 1: AI-Powered Chatbot for Oil & Gas Sector

 Designed and implemented an AI-driven chatbot capable of responding to user queries by

 Created embeddings for all processed content using OpenAI’s text-embedding-ada-002

 Employed RecursiveCharacterTextSplitter to manage large document chunks effectively

 Built a Retrieval-Augmented Generation (RAG) pipeline to provide contextual, accurate

 Integrated LangChain’s ConversationalRetrievalChain with buffer memory to maintain

 Developed a FastAPI-based backend API to orchestrate the end-to-end chatbot operations,

Task 2: Intelligent Document Analysis for Supply Chain Management

 Conducted comparative analysis of vendor quotations using extracted insights to assist in

 Delivered data-backed recommendations to procurement and decision-making teams,

Task 3: Enterprise Automation – Expression of Interest (EOI) Response Generation

 Contributed to an enterprise-grade automation project aimed at generating responses for

 Built sector-specific document embeddings using a pipeline similar to Task 1, enabling

 Engineered a custom retriever capable of caching frequently asked queries, implementing a

Use delta query to track changes

You might also like