0% found this document useful (0 votes)

48 views8 pages

Program 4

The document outlines a program that uses pre-trained GloVe word embeddings to enrich prompts for a Generative AI model by adding semantically similar words. It details the process of loading the GloVe model, defining a function to enrich prompts, and generating stories based on both original and enriched prompts. The program compares the outputs in terms of detail and relevance, demonstrating the effectiveness of enriched prompts in enhancing AI-generated content.

Uploaded by

Akash Y

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views8 pages

Program 4

Uploaded by

Akash Y

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Program 4:

Use word embeddings to improve prompts for Generative AI model. Retrieve similar words
using word embeddings. Use the similar words to enrich a GenAI prompt. Use the AI model to
generate responses for the original and enriched prompts. Compare the outputs in terms of
detail and relevance.

1. Importing Pre-trained Word Embeddings

import gensim.downloader as api

• gensim.downloader allows us to download pre-trained word embeddings from

gensim's model repository.
• These models provide pre-trained word vectors, so we don't have to train our own
Word2Vec model from scratch.

2. Loading the GloVe Model

model = api.load("glove-wiki-gigaword-50")

• This downloads and loads the GloVe (Global Vectors for Word Representation)
model trained on Wikipedia (glove-wiki-gigaword-50).
• The model contains word vectors of size 50 (50-dimensional vector representations
of words).
• Each word is mapped to a high-dimensional numerical representation, which helps
find semantic similarities.

3. Function Definition: enrich_prompt

def enrich_prompt(prompt, num_similar=3):

• This function takes a prompt (text input) and enriches it by adding similar words.
• num_similar=3: Specifies the number of similar words to add for each word in the
prompt.

4. Splitting the Prompt into Words

words = prompt.split()

• Splits the prompt (input sentence) into individual words.

5. Initialize an Empty List

enriched_words = []

• Creates an empty list enriched_words to store words along with their similar words.

6. Loop Through Each Word in the Prompt

for word in words:

• Iterates through each word in the prompt.

7. Finding Similar Words Using GloVe

try:
similar_words = [w for w, _ in model.most_similar(word,
topn=num_similar)]

• model.most_similar(word, topn=num_similar): Finds the num_similar (default

3) most similar words for the given word.
• It returns a list of tuples: (similar_word, similarity_score), but we only extract
similar_word.
• Example:

model.most_similar("cat", topn=3)

May return:

[('dog', 0.91), ('kitten', 0.85), ('feline', 0.83)]

Meaning "dog", "kitten", and "feline" are most similar to "cat".

enriched_words.append(word + " (" + ", ".join(similar_words) + ")")

• Formats the word by appending its similar words in parentheses.

• Example:

"cat" → "cat (dog, kitten, feline)"

• Appends this to the enriched_words list.

9. Handling Words Not Found in GloVe

except KeyError:
enriched_words.append(word)

• If the word is not found in the GloVe vocabulary, it remains unchanged.

• This avoids errors for uncommon words or typos.
10. Join Words Back Into a Sentence
return " ".join(enriched_words)

• Converts the list of enriched words back into a sentence.

11. Define an Original Prompt

original_prompt = "Write a story about a cat."

• This is the original input sentence.

12. Generate an Enriched Prompt

enriched_prompt = enrich_prompt(original_prompt)

• Calls the function enrich_prompt() with the input "Write a story about a
cat."
• Returns an enriched version of the prompt with similar words added.

13. Print the Results

print("Original Prompt:", original_prompt)
print("Enriched Prompt:", enriched_prompt)

• Prints both the original and enriched prompts.

Example Output
Original Prompt: Write a story about a cat.
Enriched Prompt: Write a (another, an, one) story (stories, book, tale)
about (than, there, more) a (another, an, one) cat.

• The function adds context to the prompt by suggesting words that are semantically
related.

Summary

• Loads pre-trained GloVe embeddings from gensim.

• Finds similar words for each word in the input prompt.
• Formats the enriched prompt by adding similar words in parentheses.
• Handles missing words gracefully.
• This technique can be used for prompt expansion, NLP applications, and creative
writing.

4a:

pip install gensim

import gensim.downloader as api

model = api.load("glove-wiki-gigaword-50")

def enrich_prompt(prompt, num_similar=3):

words = prompt.split()
enriched_words = []
for word in words:
try:
similar_words = [w for w, _ in model.most_similar(word, topn=num_similar)]
enriched_words.append(word + " (" + ", ".join(similar_words) + ")")
except KeyError:
enriched_words.append(word)
return " ".join(enriched_words)

original_prompt = "Write a story about a Dog."

enriched_prompt = enrich_prompt(original_prompt)

print("Original Prompt:", original_prompt)

print("Enriched Prompt:", enriched_prompt)
4b:
import gensim.downloader as api
import random

# Load the GloVe model (only needs to be done once)

try:
model = api.load("glove-wiki-gigaword-50")
except ValueError:
print("Downloading glove-wiki-gigaword-50 model...")
model = api.load("glove-wiki-gigaword-50")

def enrich_prompt(prompt, num_similar=3):

"""
Enriches a prompt by adding similar words to each word in the prompt.

Args:
prompt (str): The original prompt.
num_similar (int): The number of similar words to add.

Returns:
str: The enriched prompt.
"""
words = prompt.split()
enriched_words = []
for word in words:
try:
similar_words = [w for w, _ in model.most_similar(word, topn=num_similar)]
enriched_words.append(word + " (" + ", ".join(similar_words) + ")")
except KeyError:
enriched_words.append(word)
return " ".join(enriched_words)

def generate_story(prompt, length=100):

"""
Generates a simple story based on a prompt. This is a VERY basic
example and does not use any advanced language models. It's just
to illustrate the difference in story content.

Args:
prompt (str): The prompt to base the story on.
length (int): The approximate length of the story in words.

Returns:
str: A generated story.
"""

story = ""
words = prompt.split()
possible_next_words = words[:] # start with the words from the prompt
current_word = random.choice(words)
story += current_word + " "

for _ in range(length - 1):

next_word = random.choice(possible_next_words)
story += next_word + " "
possible_next_words.append(next_word) #add prev word
# Add some simple logic to make the story slightly more coherent.
if next_word in ["a", "an", "the"]:
possible_next_words.extend(words) # Boost words from the original prompt
if next_word in [".", "?", "!"]:
possible_next_words.extend(words[:]) # restart with key words from prompt
# add random meaningful words
meaningful_words = ["happily", "suddenly", "quietly", "jumped","ran", "slept","ate",
"thought","dreamed"]
possible_next_words.append(random.choice(meaningful_words))

return story + "."

# Example Usage:
original_prompt = "Write a story about a cat."
enriched_prompt = enrich_prompt(original_prompt)

print("Original Prompt:", original_prompt)

print("Enriched Prompt:", enriched_prompt)

original_story = generate_story(original_prompt)
enriched_story = generate_story(enriched_prompt)

print("\nOriginal Story:\n", original_story)

print("\nEnriched Story:\n", enriched_story)

# Compare the results

print("\nStory Lengths:")
print("Original Story:", len(original_story.split()))
print("Enriched Story:", len(enriched_story.split()))
print("\nOriginal Prompt Response Length:", len(original_prompt))
print("Enriched Prompt Response Length:", len(enriched_prompt))

inprotected.com

GAI4
No ratings yet
GAI4
2 pages
Generative AI Lab Manual
No ratings yet
Generative AI Lab Manual
24 pages
Genaii
No ratings yet
Genaii
5 pages
Lab
No ratings yet
Lab
8 pages
Enhance AI Prompts with Word Embeddings
No ratings yet
Enhance AI Prompts with Word Embeddings
2 pages
Gen AI PRG-5
No ratings yet
Gen AI PRG-5
4 pages
Import Gensim
No ratings yet
Import Gensim
8 pages
Gen AI Lab
No ratings yet
Gen AI Lab
22 pages
Gen AIL
No ratings yet
Gen AIL
12 pages
EWIT
No ratings yet
EWIT
21 pages
Model B
No ratings yet
Model B
3 pages
GenAI Shortened
No ratings yet
GenAI Shortened
8 pages
Gen AI Micro
No ratings yet
Gen AI Micro
15 pages
Word Embeddings & Similarity Analysis
No ratings yet
Word Embeddings & Similarity Analysis
12 pages
Generative AI 2
No ratings yet
Generative AI 2
24 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
Ai&Ml Bai601 NLP Lab Manual
No ratings yet
Ai&Ml Bai601 NLP Lab Manual
48 pages
Word Generation in NLP with Bigram Model
No ratings yet
Word Generation in NLP with Bigram Model
2 pages
Generative AI
No ratings yet
Generative AI
16 pages
Genai Lab 1
No ratings yet
Genai Lab 1
6 pages
Gen AI Prog5
No ratings yet
Gen AI Prog5
2 pages
Genai
No ratings yet
Genai
17 pages
1st Programme
No ratings yet
1st Programme
16 pages
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
No ratings yet
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
5 pages
TSA Lab Manual New
No ratings yet
TSA Lab Manual New
14 pages
AI Prompts
No ratings yet
AI Prompts
2 pages
Batch 2
No ratings yet
Batch 2
13 pages
NLP Exp4
No ratings yet
NLP Exp4
10 pages
NLP Lab Codes Till Mod3
No ratings yet
NLP Lab Codes Till Mod3
7 pages
Gen AI VTUCircle
No ratings yet
Gen AI VTUCircle
1 page
Gen Ai Lab Programs
No ratings yet
Gen Ai Lab Programs
15 pages
Word Embedding Learning Process
No ratings yet
Word Embedding Learning Process
6 pages
NLP with Trigram and Bigram Models
No ratings yet
NLP with Trigram and Bigram Models
5 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
Next Word Prediction With NLP and Deep Learning
No ratings yet
Next Word Prediction With NLP and Deep Learning
13 pages
Gen Ai Lab
No ratings yet
Gen Ai Lab
3 pages
NLP - (Natural Language Processing Lab Manual)
No ratings yet
NLP - (Natural Language Processing Lab Manual)
12 pages
Python Text Processing Techniques
No ratings yet
Python Text Processing Techniques
13 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
7 pages
Self Evaluation Exercises
No ratings yet
Self Evaluation Exercises
12 pages
UBC Summer School in NLP - VSP 2019 Lecture 9
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 9
17 pages
NLP Manual Final
No ratings yet
NLP Manual Final
22 pages
Exp-2 NLP
No ratings yet
Exp-2 NLP
4 pages
NLP Lab Manual - Final
No ratings yet
NLP Lab Manual - Final
15 pages
Lab Manual - NLP
No ratings yet
Lab Manual - NLP
60 pages
NLP Exp2
No ratings yet
NLP Exp2
6 pages
Module 3 - NLP
No ratings yet
Module 3 - NLP
34 pages
Gen Ai-1
No ratings yet
Gen Ai-1
6 pages
French-English Seq2Seq Translation
No ratings yet
French-English Seq2Seq Translation
45 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
21 pages
Quadgram Language Model Analysis
No ratings yet
Quadgram Language Model Analysis
29 pages
Lab Manual
No ratings yet
Lab Manual
16 pages
Word2Vec for NLP Enthusiasts
No ratings yet
Word2Vec for NLP Enthusiasts
13 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
AI Lab Programs
No ratings yet
AI Lab Programs
9 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
123 NLP 456
No ratings yet
123 NLP 456
4 pages
HPE - c04423965 - RESTful Interface Tool User Guide
No ratings yet
HPE - c04423965 - RESTful Interface Tool User Guide
82 pages
EMTRAC System Initial Setup Guide
No ratings yet
EMTRAC System Initial Setup Guide
2 pages
MIUI MSA Global SDK Logs Analysis
No ratings yet
MIUI MSA Global SDK Logs Analysis
5 pages
Projectdiscovery - Katana - A Next-Generation Crawling and Spidering Framework
No ratings yet
Projectdiscovery - Katana - A Next-Generation Crawling and Spidering Framework
15 pages
Snowmen at Work
No ratings yet
Snowmen at Work
9 pages
15037440046
67% (3)
15037440046
2 pages
CS201 Cisco CPPE1 - Final Answers by ₦Ї₦ℑ₳
No ratings yet
CS201 Cisco CPPE1 - Final Answers by ₦Ї₦ℑ₳
34 pages
Access Modififers Salesforce
No ratings yet
Access Modififers Salesforce
4 pages
License Manager User Guide
No ratings yet
License Manager User Guide
27 pages
A & D Interoperability: Grasshopper: Parametric Design: Automating TSD Analysis & Design With Grasshopper
No ratings yet
A & D Interoperability: Grasshopper: Parametric Design: Automating TSD Analysis & Design With Grasshopper
50 pages
Fc0 U61 Comptia It Fundamentals Itf Certification 200 Exam Practice Questions With Detailed Explanations 1nbsped
No ratings yet
Fc0 U61 Comptia It Fundamentals Itf Certification 200 Exam Practice Questions With Detailed Explanations 1nbsped
176 pages
Critical Path Analysis Solved Example - MilestoneTask
80% (5)
Critical Path Analysis Solved Example - MilestoneTask
12 pages
Parosh CV
No ratings yet
Parosh CV
1 page
IEC Certification Kit: Embedded Coder™ Conformance Demonstration Template
No ratings yet
IEC Certification Kit: Embedded Coder™ Conformance Demonstration Template
22 pages
Introduction To Programming: Engineering Curriculum - 2023 JNTUK B.Tech. R23 Regulations
No ratings yet
Introduction To Programming: Engineering Curriculum - 2023 JNTUK B.Tech. R23 Regulations
2 pages
Introduction Data Management
No ratings yet
Introduction Data Management
12 pages
Data Structures and Algorithms in Python 1st Edition by Michael Goodrich, Roberto Tamassia, Michael Goldwasser ISBN 9781118476734 1118476735
100% (18)
Data Structures and Algorithms in Python 1st Edition by Michael Goodrich, Roberto Tamassia, Michael Goldwasser ISBN 9781118476734 1118476735
71 pages
Module 2 - Intro To HTML
No ratings yet
Module 2 - Intro To HTML
39 pages
SVMCM Manual
No ratings yet
SVMCM Manual
29 pages
Installation Guide - ArchWiki
No ratings yet
Installation Guide - ArchWiki
10 pages
B.Sc Data Analytics Excel Guide
No ratings yet
B.Sc Data Analytics Excel Guide
31 pages
HPE Reference Architecture For Digital Workspace On HPE Synergy Composable Infrastructure
No ratings yet
HPE Reference Architecture For Digital Workspace On HPE Synergy Composable Infrastructure
57 pages
JP CV
No ratings yet
JP CV
2 pages
Whole Team Approach to Agile Testing
No ratings yet
Whole Team Approach to Agile Testing
1 page
Customer Support & Telesales Expert
No ratings yet
Customer Support & Telesales Expert
1 page
Unit-4-Software Configuration Management
No ratings yet
Unit-4-Software Configuration Management
4 pages
Programming C# Extended Features: Hands-On: Course 973
No ratings yet
Programming C# Extended Features: Hands-On: Course 973
376 pages
Dharavi Slums Case Study
0% (1)
Dharavi Slums Case Study
13 pages
GOA - TCP - User Manual - Version - 3.2
No ratings yet
GOA - TCP - User Manual - Version - 3.2
163 pages

Program 4

Uploaded by

Program 4

Uploaded by

Program 4:

1. Importing Pre-trained Word Embeddings

• gensim.downloader allows us to download pre-trained word embeddings from

2. Loading the GloVe Model

3. Function Definition: enrich_prompt

4. Splitting the Prompt into Words

• Splits the prompt (input sentence) into individual words.

5. Initialize an Empty List

6. Loop Through Each Word in the Prompt

• Iterates through each word in the prompt.

7. Finding Similar Words Using GloVe

• model.most_similar(word, topn=num_similar): Finds the num_similar (default

[('dog', 0.91), ('kitten', 0.85), ('feline', 0.83)]

Meaning "dog", "kitten", and "feline" are most similar to "cat".

enriched_words.append(word + " (" + ", ".join(similar_words) + ")")

• Formats the word by appending its similar words in parentheses.

"cat" → "cat (dog, kitten, feline)"

• Appends this to the enriched_words list.

9. Handling Words Not Found in GloVe

• If the word is not found in the GloVe vocabulary, it remains unchanged.

• Converts the list of enriched words back into a sentence.

11. Define an Original Prompt

• This is the original input sentence.

12. Generate an Enriched Prompt

13. Print the Results

• Prints both the original and enriched prompts.

• Loads pre-trained GloVe embeddings from gensim.

pip install gensim

import gensim.downloader as api

def enrich_prompt(prompt, num_similar=3):

original_prompt = "Write a story about a Dog."

print("Original Prompt:", original_prompt)

# Load the GloVe model (only needs to be done once)

def enrich_prompt(prompt, num_similar=3):

def generate_story(prompt, length=100):

for _ in range(length - 1):

return story + "."

print("Original Prompt:", original_prompt)

print("\nOriginal Story:\n", original_story)

# Compare the results

You might also like