Seminar on
“User-Oriented Evaluation
Methods for Interactive
Web Search Interfaces”
Under the Guidance of: Presented by:
PROF V.V.KONDHALKAR RAHUL. P. GUPTA
AGENDA
INTRODUCTION
HISTORY
TYPES OF SEARCH ENGINE
WORKING
METHODS OF TEXT SEARCHING
RELEVANCE RANKING
META SEARCH ENGINE
CONCLUSION
INTRODUCTION
Web Search Engine is a software program that
searches the Internet (bunch of websites) based
on the words that you designate as search terms
(query words).
Search engines look through their own databases
of information in order to find what it is that you are
looking for.
Web Search Engines are a good example for
massively sized Information Retrieval Systems.
HISTORY
Archie – First search tool for the Internet
Gopher – indexed plain text documents
Jughead – searched the files stored in Gopher
index systems
Wandex – first Web search engine
TYPES OF SEARCH ENGINE
DIRECTORIES
Directories are staffed by human editors who consider every
new website submitted and, if they decide it is acceptable,
assign it to the appropriate category (YAHOO).
WEB CRAWLER
An automated Web browser which follows every link it sees.
(GOOGLE)
WORKING
A search engine operates in the following 3 steps
1. Web crawling
2. Indexing
3. Searching
1. WEB
CRAWLING
It is the process of scanning web sites to add new pages
and to update existing one.
A web spiders is an automated system.
Googlebot is Google’s web
crawling robot. It functions like web
browser, by sending a request to a
web server for a web page ,
downloading the entire page, then
handing it off to Google’s indexer.
Spiders are always crawling
2. INDEXING
It allows information to be found as quickly as possible
The most effective ways is to build a hash table
For example, that the "M" section of the dictionary is much thicker
than the "X" section
Lycos indexes the title, headings, subheadings and the hyperlinks to
other sites, along with the first 20 lines of text and the 100 words that
occur most often
Infoseek uses a full-text indexing system, picking up every word in
the text except commonly occurring stop words such as "a," "an,"
"the," "is," "and," "or," and "www."
AltaVista claims to index all words, even the articles, "a," "an," and "the."
3. SEARCHING
METHODS OF TEXT
SEARCHING
KEYWORD SEARCHING
Most search engines do their text query and retrieval
using keywords.
search engines have trouble with so-called stemming.
CONCEPT SEARCHING (CLUSTERING)
Concept-based search systems try to determine what
you mean, not just what you say.
Excite is currently the best-known general-purpose
search engine site on the Web that relies on concept-
based searching
RELEVANCE RANKING
Term Frequency
Locations of Terms
Link Analysis
Popularity
Date of Publication
Proximity of Query Terms
META SEARCH ENGINE
A meta-search engine is a search tool that sends user
requests to several other search engines and/or databases
and aggregates the results into a single list or displays them
according to their source.
Meta-search engines do not
own a database of Web pages
E.g.; DOGPILE
Examples of different Search Engine
SEARCH META-SEARCH DIRECTORY
www.4websearch.com www.Ixquick.com www.yahoo.com
www.altavista.com www.mamma.com www.about.com
www.alltheweb.com www.metacrawler.com www.galaxy.com
www.google.com www.redesearch.com www.goguides.org
www.hotbot.com www.surfwax.com www.looksmart.com
www.lycos.com www.turbo10.com www.zeal.com
CONCLUSION
Though there are many search engines available on the
web, the searching methods and the engines need to go a
long way for efficient retrieval of information on relevant
topics.
None of the search engines out there today are perfect,
but using the right one at the right time can make all the
difference.
Use Meta search engines. They minimize your search
to a great extent. The good news is that new search
engines are evolving every day to improve retrieval
efficiency.
REFERENCE
www.howstuffworks.com
www.scribd.com
www.searchenginewatch.com
www.informationplease.com