0% found this document useful (0 votes)

34 views4 pages

IR Unit I Notes

The document outlines the characteristics of the World Wide Web, highlighting its global accessibility, interconnectedness, and support for multimedia and user interactivity. It discusses the transformative impact of the web on information retrieval, emphasizing global availability, search engines, and personalized retrieval while also addressing challenges like information overload and credibility concerns. Additionally, it differentiates between web search and information retrieval (IR), noting their distinct scopes, technologies, and user interactions.

Uploaded by

mohamedfarookali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views4 pages

IR Unit I Notes

Uploaded by

mohamedfarookali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Characteristics of web

The web (World Wide Web) is a vast, interconnected system for accessing and sharing
information over the internet. Here are some key characteristics:
1. Global Accessibility
 The web is accessible to anyone with an internet connection, enabling worldwide communication
and information exchange.
2. Interconnectedness
 Hyperlinks allow users to navigate seamlessly between related documents or resources across
different domains.
3. Dynamism
 The web includes both static content (unchanging) and dynamic content (customized and
updated based on user interaction or time).
4. Multimedia Support
 Supports various types of content, including text, images, videos, audio, and interactive
elements.
5. User Interactivity
 Modern web applications support complex user interactions, such as filling out forms, chatting,
and online shopping.
6. Scalability
 The web is designed to scale from small personal sites to large platforms serving millions of
users.
7. Decentralization
 No single entity controls the entire web; it is distributed and relies on various servers and
systems across the globe.
8. Cross-Platform Compatibility
 The web can be accessed from various devices, including computers, smartphones, tablets, and
IoT devices, through web browsers.
9. Open Standards
 It is built on open standards like HTML, CSS, and JavaScript, governed by organizations like the
W3C (World Wide Web Consortium).
10. Searchability
 Web content is indexed by search engines, making it easy to locate information through
keywords.
11. Evolutionary Nature
 The web continuously evolves, incorporating new technologies (e.g., Web 2.0, Web 3.0) and
practices.
12. Security
 Encryption protocols (like HTTPS) and authentication methods protect sensitive data and user
privacy.
13. Ubiquity
 The web integrates with various aspects of daily life, including education, business,
entertainment, and social networking.
Impact of web in information retrieval
The impact of the web on information retrieval has been transformative, fundamentally changing
how individuals and organizations access, process, and utilize information. Here are the key
ways the web has influenced information retrieval:
1. Global Availability of Information
 The web provides instant access to a vast repository of information on virtually any topic,
breaking down geographical and institutional barriers.
2. Search Engines as Gateways
 Search engines like Google, Bing, and DuckDuckGo have revolutionized information retrieval
by indexing web content and enabling keyword-based searches, making it quick and easy to find
relevant data.
3. Improved Accessibility
 The web ensures that diverse audiences, including those with disabilities, can retrieve
information using assistive technologies like screen readers and voice search.
4. Real-Time Updates
 Dynamic web platforms deliver real-time information, such as breaking news, stock market
updates, or weather forecasts, enabling timely decision-making.
5. Cost Reduction
 Free access to vast amounts of information reduces costs associated with traditional information
retrieval methods, such as purchasing books or journal subscriptions.
6. Personalized Information Retrieval
 Advanced algorithms analyze user behavior and preferences to deliver personalized search
results and recommendations, improving relevance and efficiency.
7. Interactivity and Collaboration
 Web 2.0 platforms enable collaborative information retrieval through forums, wikis, and social
media, allowing users to share insights and build knowledge collectively.
8. Multimedia Integration
 The web facilitates retrieval of various formats (text, audio, video, and images), accommodating
diverse learning and consumption preferences.
9. Scalability
 The web supports retrieval from small, niche databases to massive global datasets, catering to
both specific and general needs.
10. Democratization of Information
 The web empowers individuals by making high-quality information freely accessible, leveling
the playing field for education and innovation.
11. Challenges in Information Overload
 The sheer volume of web content can overwhelm users, making tools like advanced search
filters, metadata, and artificial intelligence crucial for effective retrieval.
12. Credibility Concerns
 While the web makes information widely available, it also requires users to critically evaluate
sources due to the prevalence of misinformation and biased content.
13. Data-Driven Decision Making
 The web has enabled organizations to mine data for trends and insights, enhancing business
intelligence and research methodologies.
Web Search vs. Information Retrieval (IR)

While "web search" and "information retrieval" (IR) are closely related concepts, they differ in
scope, focus, and application. Here's a comparative breakdown:

1. Definition

 Web Search:
o The process of using a search engine to find information on the web.
o Focuses on retrieving relevant web pages or resources based on user queries.
 Information Retrieval (IR):
o A broader field encompassing techniques and systems for finding relevant
information in any repository (e.g., databases, documents, or web content).
o Deals with indexing, storing, and retrieving information from structured and
unstructured data sources.

2. Scope

 Web Search:
o Limited to the web and uses search engines like Google or Bing.
o Mainly concerned with retrieving URLs, multimedia, or other web resources.
 IR:
o Can occur across various mediums, including digital libraries, file systems,
databases, or offline repositories.
o Includes advanced methods for structured retrieval (e.g., SQL queries) and
unstructured data analysis (e.g., semantic search).

3. Underlying Technology

 Web Search:
o Heavily relies on search engine algorithms (e.g., PageRank).
o Incorporates crawling, indexing, ranking, and user personalization based on
browsing behavior.
 IR:
o Utilizes broader methodologies, such as:
 Vector Space Models
 Natural Language Processing (NLP)
 Latent Semantic Analysis (LSA)
o IR systems may not necessarily use a web-based infrastructure.

4. User Interaction

 Web Search:
o User-focused and designed for simplicity and speed.
o Prioritizes ease of use, often employing simple keyword searches and
autocomplete suggestions.
 IR:
o Designed for researchers or technical users in some cases.
o May involve advanced query formulations, filters, or technical knowledge to
retrieve highly specific results.
5. Data Characteristics

 Web Search:
o Primarily targets unstructured data (HTML, blogs, social media posts, etc.).
o Deals with data inconsistency, redundancy, and credibility challenges.
 IR:
o Works with both structured (e.g., databases, spreadsheets) and unstructured
(e.g., text documents, multimedia) data.
o Focuses on metadata, tagging, and context-based analysis.

6. Examples

 Web Search:
o Typing "best smartphones 2024" into Google to retrieve top-ranked articles,
reviews, or online stores.
 IR:
o Retrieving relevant research papers on "machine learning models" from an
academic database like PubMed or IEEE Xplore.

7. Evaluation Metrics

 Web Search:
o Evaluated by user engagement metrics:
 Click-through rate (CTR)
 Bounce rate
 Time on page
o Success is based on the relevance and ranking of web pages.
 IR:
o Evaluated using precision, recall, F1-score, and Mean Average Precision (MAP).
o Focuses on accuracy and efficiency of retrieval.

8. Applications

 Web Search:
o Designed for the general public to find web-based resources.
o Commonly used in marketing, e-commerce, and day-to-day information
gathering.
 IR:
o Used in specialized domains like enterprise search (internal company data),
digital libraries, or medical diagnosis systems.

Summary Table

Aspect Web Search Information Retrieval (IR)

Scope Limited to the web Broader, covers all data repositories
User General public Researchers, technical users
Data Types Primarily unstructured web content Structured and unstructured data
Technology Search engine-specific (e.g., PageRank) Broad IR techniques (e.g., NLP, LSA)
Examples Google, Bing PubMed, enterprise search systems

Cs8080irtunitinotes 220515215754 E06d144b
No ratings yet
Cs8080irtunitinotes 220515215754 E06d144b
43 pages
Information Retrieval Course
No ratings yet
Information Retrieval Course
24 pages
Information Retrieval Techniques Overview
No ratings yet
Information Retrieval Techniques Overview
281 pages
INFS 427 - Session 08 - Online Information Retrieval Systems
No ratings yet
INFS 427 - Session 08 - Online Information Retrieval Systems
21 pages
Information Retrieval & Machine Learning: Supporting Technologies For Web Mining Research & Practice
No ratings yet
Information Retrieval & Machine Learning: Supporting Technologies For Web Mining Research & Practice
16 pages
UNIT I - Introduction and Motivation
No ratings yet
UNIT I - Introduction and Motivation
57 pages
Semantc Web and Social Networks
No ratings yet
Semantc Web and Social Networks
63 pages
1 Mod-1 - Lec-1
No ratings yet
1 Mod-1 - Lec-1
21 pages
Cs6007 Information Retrieval Question Bank
100% (3)
Cs6007 Information Retrieval Question Bank
45 pages
Ir Mod1 Notes
No ratings yet
Ir Mod1 Notes
20 pages
Irs Unit-5
No ratings yet
Irs Unit-5
28 pages
Web Mining and Search Engine Challenges
No ratings yet
Web Mining and Search Engine Challenges
50 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
11 pages
Web Development Strategies and History
No ratings yet
Web Development Strategies and History
28 pages
CS8080 Irt Q&a
No ratings yet
CS8080 Irt Q&a
54 pages
Understanding the World Wide Web
No ratings yet
Understanding the World Wide Web
5 pages
It Era-Week4-1
No ratings yet
It Era-Week4-1
13 pages
CS 327 - Lecture 1
No ratings yet
CS 327 - Lecture 1
37 pages
Living With IT ERA
No ratings yet
Living With IT ERA
17 pages
IR Workbook Answers
No ratings yet
IR Workbook Answers
36 pages
Sma Unit 1
No ratings yet
Sma Unit 1
14 pages
Evolution of the World Wide Web
No ratings yet
Evolution of the World Wide Web
9 pages
Lesson 1-ETECH - Intro To ICT
No ratings yet
Lesson 1-ETECH - Intro To ICT
45 pages
L1 Emtech Q3
No ratings yet
L1 Emtech Q3
55 pages
Unit 5
No ratings yet
Unit 5
20 pages
Understanding Web Evolution and Internet Basics
No ratings yet
Understanding Web Evolution and Internet Basics
7 pages
Ite 5 8
No ratings yet
Ite 5 8
7 pages
Introduction To IR 2021
No ratings yet
Introduction To IR 2021
40 pages
ICT Innovations in Pandemic Response
No ratings yet
ICT Innovations in Pandemic Response
9 pages
Wad Module3
No ratings yet
Wad Module3
38 pages
Module 1
No ratings yet
Module 1
53 pages
Understanding ICT in Daily Life
No ratings yet
Understanding ICT in Daily Life
18 pages
q1 Lesson 1 Part 1
No ratings yet
q1 Lesson 1 Part 1
26 pages
ICT Evolution and Trends
No ratings yet
ICT Evolution and Trends
46 pages
ICT Concepts and Web Evolution
No ratings yet
ICT Concepts and Web Evolution
49 pages
Module 12025
No ratings yet
Module 12025
9 pages
Introduction to ICT and Web Evolution
No ratings yet
Introduction to ICT and Web Evolution
5 pages
World Wide Web
No ratings yet
World Wide Web
43 pages
World Wide Web in Detail
No ratings yet
World Wide Web in Detail
4 pages
Lesson1 Introduction To ICT
No ratings yet
Lesson1 Introduction To ICT
34 pages
ICT Trends and Web Evolution
No ratings yet
ICT Trends and Web Evolution
5 pages
ICT Trends for Students
No ratings yet
ICT Trends for Students
22 pages
Emp Tech (Quarter 1, Module 1)
No ratings yet
Emp Tech (Quarter 1, Module 1)
4 pages
EMPO TECH Notes
No ratings yet
EMPO TECH Notes
3 pages
EMPOWERMENT TECHNOLOGY Reviewer
100% (1)
EMPOWERMENT TECHNOLOGY Reviewer
5 pages
Module 3 - IT
No ratings yet
Module 3 - IT
6 pages
Lesson 2 Types of Web
No ratings yet
Lesson 2 Types of Web
27 pages
ICT Basics for High School Students
No ratings yet
ICT Basics for High School Students
63 pages
E-Tech Course Guide for Students
No ratings yet
E-Tech Course Guide for Students
32 pages
Ict Reviewer
No ratings yet
Ict Reviewer
5 pages
Unit-5. Search Engines
No ratings yet
Unit-5. Search Engines
105 pages
Data Mining & Web Analytics Guide
No ratings yet
Data Mining & Web Analytics Guide
21 pages
(L1) Introduction To ICT
No ratings yet
(L1) Introduction To ICT
17 pages
Unit5 Irt
No ratings yet
Unit5 Irt
10 pages
World Wide Web
No ratings yet
World Wide Web
58 pages
Web 3.0
No ratings yet
Web 3.0
4 pages
Lecture Grade 12
No ratings yet
Lecture Grade 12
2 pages
iDS-7216HQHI-M2 Data Sheet
No ratings yet
iDS-7216HQHI-M2 Data Sheet
6 pages
TRI-NIT Hackathon Problem Statements 2023
0% (1)
TRI-NIT Hackathon Problem Statements 2023
33 pages
Useful Download (/downloads) : Contact Us
No ratings yet
Useful Download (/downloads) : Contact Us
2 pages
61 Essential ASP - NET Core Interview Questions and Answers
No ratings yet
61 Essential ASP - NET Core Interview Questions and Answers
11 pages
Avaya IP Office 10.1 Telmex SIP Setup
No ratings yet
Avaya IP Office 10.1 Telmex SIP Setup
37 pages
Laboratory3-ITT557-2020878252-SITI FARHANA
No ratings yet
Laboratory3-ITT557-2020878252-SITI FARHANA
16 pages
Mccann - 2 22 19
No ratings yet
Mccann - 2 22 19
1 page
AWS Data Engineer Kindle Guide
No ratings yet
AWS Data Engineer Kindle Guide
16 pages
Devesh Kumar's Online Disappearance
No ratings yet
Devesh Kumar's Online Disappearance
6 pages
Crypto Currency Tracker PBL Report
No ratings yet
Crypto Currency Tracker PBL Report
13 pages
Email and Password Recovery
No ratings yet
Email and Password Recovery
20 pages
5th Grade Water Cycle Activities
No ratings yet
5th Grade Water Cycle Activities
7 pages
SaaS HCM Non-Functional Requirements
No ratings yet
SaaS HCM Non-Functional Requirements
34 pages
Google Sites Tutorial
No ratings yet
Google Sites Tutorial
12 pages
Quality Management in Industry 4.0
No ratings yet
Quality Management in Industry 4.0
10 pages
UK Standard CDR Format v3
No ratings yet
UK Standard CDR Format v3
15 pages
Aniket Patil 8237979061
No ratings yet
Aniket Patil 8237979061
1 page
Answers PHD
No ratings yet
Answers PHD
1 page
100th Day of School Free Activities
No ratings yet
100th Day of School Free Activities
20 pages
Collins Aerospace Account Setup Guide
No ratings yet
Collins Aerospace Account Setup Guide
6 pages
Putting References On Resume
100% (2)
Putting References On Resume
7 pages
Telus Notes
No ratings yet
Telus Notes
47 pages
Axis m3014 User Manual en US 96908
No ratings yet
Axis m3014 User Manual en US 96908
52 pages
Ite6101 Long Quiz 3
No ratings yet
Ite6101 Long Quiz 3
8 pages
Lesson 4 Multimedia Formats
No ratings yet
Lesson 4 Multimedia Formats
17 pages
Digital Library - Definition, Scope, and Characteristics
100% (2)
Digital Library - Definition, Scope, and Characteristics
7 pages
MFC Tutorial PDF
100% (1)
MFC Tutorial PDF
6,778 pages
EXOdesigner PRSH en
No ratings yet
EXOdesigner PRSH en
4 pages
Simplified Deadlock Detection Models
No ratings yet
Simplified Deadlock Detection Models
4 pages
Project Series-5MP-I-HIPB5PI-MF
100% (1)
Project Series-5MP-I-HIPB5PI-MF
3 pages

IR Unit I Notes

Uploaded by

IR Unit I Notes

Uploaded by

Characteristics of web

Aspect Web Search Information Retrieval (IR)

You might also like