Project Plan For RAG Algorithm

Project CSEC aims to develop a model for retrieving project information based on natural language queries related to structural features and unique design elements. The project outlines four model versions, with the latest focusing on a hybrid approach that combines vector embeddings and GPT reasoning for improved accuracy. Next steps include fine-tuning the model, expanding the dataset, and making the model accessible through a user-friendly interface.

Uploaded by

Mohammed Salman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views6 pages

Project Plan For RAG Algorithm

Uploaded by

Mohammed Salman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Project CSEC

A Model to retrieve projects information’s (technical drawings/relevant documents) based on

natural language input — such as structural features or unique design elements

For example:

1. Structural Criteria: “List buildings with steel roofs and more than 2 stories.”

2. Unique Features: “Which projects have circular staircases?”

Extracted Project Dataset:

Model’s Data Structure:

Prototype Model’s

1. Model V1 – Text Similarity (Keyword Match)

- simple string-matching using Python & CSV

- Downsides: It only finds results if the query matches the

text exactly (no meaning or relationships understanding)
2. Model V2 – Vector Embedding Search (Semantic)
- use Sentence Transformers + FAISS (facebook ai similarity
check)
- Embeddings capture meaning but ignore structured
relationships
- Example: It might find “pile foundation” similar to “raft
foundation” just because they occur in similar contexts not
because they’re technically related.
- Can't enforce filters like: "only projects with 2+ stories
AND circular staircase AND pile foundation".
3. Model V3 – Hybrid (Embedding + GPT Reasoning)
- use vector embedding for narrowing candidates, then GPT-
4 for filtering & reasoning
- Operational Cost
- Need improvement for higher accuracy.
4. Model V4 – Hybrid model with domain specific fine
tunings
- Highly Accurate + Domain-Aware Responses

Next Steps

1. Fine-Tune the Model (for Higher Accuracy)

- Implementing model V4
Goal: Teach the model domain-specific knowledge using
examples (like project queries & expected outputs and their
relationships)

2. Improve with New Data + Set Up Relational DB

Goal: Expand and organize your data
(drawings/projects/features) to support scalable search &
retrieval.

3. Make the Model Accessible to Others (Web App or)

- Design UI
- Connect UI to backend
- Create a cloud/server DB and deploy the system

FYP Proposal
No ratings yet
FYP Proposal
18 pages
Pipeline For ML Problem
No ratings yet
Pipeline For ML Problem
13 pages
Projects For Ai
No ratings yet
Projects For Ai
8 pages
1905.13750 Sketch2code Generating A Website From A Paper
No ratings yet
1905.13750 Sketch2code Generating A Website From A Paper
64 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
10 pages
Improving Retrieval Augmented Generation
No ratings yet
Improving Retrieval Augmented Generation
33 pages
Prateek Gupta Resume
No ratings yet
Prateek Gupta Resume
3 pages
Career Dendrogram (Sem-7 - Minor - Project)
No ratings yet
Career Dendrogram (Sem-7 - Minor - Project)
22 pages
Best Project Ideas in Web Dev
No ratings yet
Best Project Ideas in Web Dev
11 pages
Bhavnesh Baghel's Resume
No ratings yet
Bhavnesh Baghel's Resume
2 pages
AI Powered Architecture Design 1
No ratings yet
AI Powered Architecture Design 1
11 pages
Master Thesis Mattias Wiberg Jonas Lauri
No ratings yet
Master Thesis Mattias Wiberg Jonas Lauri
75 pages
Sahil Garg Updated For Azure
No ratings yet
Sahil Garg Updated For Azure
8 pages
Project Ideas
No ratings yet
Project Ideas
2 pages
Floor Plan Generation Using GAN
100% (1)
Floor Plan Generation Using GAN
144 pages
Data Science & Engineering Project Ideas
No ratings yet
Data Science & Engineering Project Ideas
2 pages
Data Science Project List - Sheet1
No ratings yet
Data Science Project List - Sheet1
5 pages
Raviteja Kancharla: ML Engineer & Developer
No ratings yet
Raviteja Kancharla: ML Engineer & Developer
2 pages
Open Lab Report - Group 5
No ratings yet
Open Lab Report - Group 5
42 pages
Data Science Projects
No ratings yet
Data Science Projects
1 page
Report
No ratings yet
Report
36 pages
Deep Learning Projects
No ratings yet
Deep Learning Projects
13 pages
Large-Scale Auto-Regressive Modeling of Street Networks: Michael Birsak, Tom Kelly, Wamiq Para, Peter Wonka
No ratings yet
Large-Scale Auto-Regressive Modeling of Street Networks: Michael Birsak, Tom Kelly, Wamiq Para, Peter Wonka
12 pages
Generative Certification Notes-1
No ratings yet
Generative Certification Notes-1
22 pages
Project Ideas
No ratings yet
Project Ideas
5 pages
CV NguyenVanTuan
No ratings yet
CV NguyenVanTuan
3 pages
Essential Data Science Projects Guide
No ratings yet
Essential Data Science Projects Guide
1 page
RAG Model for Student Learning Aid
No ratings yet
RAG Model for Student Learning Aid
5 pages
D Caltech PG AI & ML Project
No ratings yet
D Caltech PG AI & ML Project
4 pages
ML Week 8
No ratings yet
ML Week 8
12 pages
Experiment 1
No ratings yet
Experiment 1
6 pages
Deep Learning Image Search Engine
No ratings yet
Deep Learning Image Search Engine
5 pages
CS F469 IR System Assignment
No ratings yet
CS F469 IR System Assignment
4 pages
Gen Ai Nash Phase 3
No ratings yet
Gen Ai Nash Phase 3
8 pages
Scalable Entity Resolution
No ratings yet
Scalable Entity Resolution
66 pages
Chat
No ratings yet
Chat
6 pages
Eth 48401 01
No ratings yet
Eth 48401 01
102 pages
Session 02 Practical Approach For AIML Projects
No ratings yet
Session 02 Practical Approach For AIML Projects
62 pages
10 1016@j Autcon 2010 06 007
No ratings yet
10 1016@j Autcon 2010 06 007
15 pages
Deep Image Search Project
No ratings yet
Deep Image Search Project
13 pages
2203a52154 Daup Report
No ratings yet
2203a52154 Daup Report
13 pages
IEEE Python & ML Projects 2019
No ratings yet
IEEE Python & ML Projects 2019
2 pages
Datascience
No ratings yet
Datascience
7 pages
AI Practical File Expanded
No ratings yet
AI Practical File Expanded
41 pages
Bachelor of Technology
No ratings yet
Bachelor of Technology
39 pages
Data Science Projects for Beginners
No ratings yet
Data Science Projects for Beginners
2 pages
Indradhanu Climate Literacy Ai Platform - Full Build Blueprint (MVP Finale)
No ratings yet
Indradhanu Climate Literacy Ai Platform - Full Build Blueprint (MVP Finale)
7 pages
Semi-Follower Robot Project Summary
No ratings yet
Semi-Follower Robot Project Summary
1 page
Projectacademy Artificial Intelligence Projects List 2023
No ratings yet
Projectacademy Artificial Intelligence Projects List 2023
8 pages
Bangkit 2021 Capstone Project Plan
No ratings yet
Bangkit 2021 Capstone Project Plan
5 pages
Resume 1734199998
No ratings yet
Resume 1734199998
1 page
Computer Vision Machine Learning Fundamental Algorithms Game Development
No ratings yet
Computer Vision Machine Learning Fundamental Algorithms Game Development
3 pages
Ślusarczyk Strug 2023 Machine Learning Methods in Bim Based Applications A Review
No ratings yet
Ślusarczyk Strug 2023 Machine Learning Methods in Bim Based Applications A Review
22 pages
Mursaleen Hassan Resume
No ratings yet
Mursaleen Hassan Resume
1 page
House Price Prediction
No ratings yet
House Price Prediction
55 pages
AI ML Python Content
No ratings yet
AI ML Python Content
4 pages
Top Tech Topics
No ratings yet
Top Tech Topics
10 pages

Project Plan For RAG Algorithm

Uploaded by

Project Plan For RAG Algorithm

Uploaded by

Project CSEC

A Model to retrieve projects information’s (technical drawings/relevant documents) based on

2. Unique Features: “Which projects have circular staircases?”

Extracted Project Dataset:

Model’s Data Structure:

1. Model V1 – Text Similarity (Keyword Match)

- Downsides: It only finds results if the query matches the

1. Fine-Tune the Model (for Higher Accuracy)

2. Improve with New Data + Set Up Relational DB

3. Make the Model Accessible to Others (Web App or)

You might also like