SlideShare a Scribd company logo
Web search engine From Wikipedia, the free encyclopedia
Search engine" redirects here. For other uses, see  Search engine (disambiguation) . A  web search engine  is designed to search for information on the  World Wide Web . The search results are usually presented in a list of results and are commonly called  hits . The information may consist of  web pages , images, information and other types of files. Some search engines also  mine data  available in  databases  or  open directories . Unlike  Web directories , which are maintained by human editors, search engines operate  algorithmically  or are a mixture of algorithmic and human input.
History During the early development of the web, there was a list of  webservers  edited by  Tim Berners-Lee  and hosted on the  CERN  webserver. One historical snapshot from 1992 remains. [1]  As more webservers went online the central list could not keep up. On the  NCSA  site new servers were announced under the title "What's New!"
The very first tool used for searching on the Internet was  Archie . [3]  The name stands for "archive" without the "v." It was created in 1990 by  Alan  Emtage , a student at  McGill University  in  Montreal . The program downloaded the directory listings of all the files located on public anonymous FTP ( File Transfer Protocol ) sites, creating a searchable database of file names; however, Archie did not index the contents of these sites.
The rise of  Gopher  (created in 1991 by  Mark  McCahill  at the  University of Minnesota ) led to two new search programs,  Veronica  and  Jughead . Like Archie, they searched the file names and titles stored in Gopher index systems. Veronica ( V ery  E asy  R odent- O riented  N et-wide  I ndex to  C omputerized  A rchives) provided a keyword search of most Gopher menu titles in the entire Gopher listings. Jughead ( J onzy's  U niversal  G opher  H ierarchy  E xcavation  A nd  D isplay) was a tool for obtaining menu information from specific Gopher servers. While the name of the search engine "Archie" was not a reference to the  Archie comic book  series, " Veronica " and " Jughead " are characters in the series, thus referencing their predecessor.
In the summer of 1993, no search engine existed yet for the web, though numerous specialized catalogues were maintained by hand.  Oscar  Nierstrasz  at the  University of Geneva  wrote a series of  Perl  scripts that would periodically mirror these pages and rewrite them into a standard format which formed the basis for  W3Catalog , the web's first primitive search engine, released on September 2, 1993 [4] .
In June 1993, Matthew Gray, then at  MIT , produced what was probably the first  web robot , the  Perl -based  World Wide Web Wanderer , and used it to generate an index called 'Wandex'. The purpose of the Wanderer was to measure the size of the World Wide Web, which it did until late 1995. The web's second search engine  Aliweb  appeared in November 1993. Aliweb did not use a  web robot , but instead depended on being notified by website administrators of the existence at each site of an index file in a particular format.
JumpStation  (released in December 1993 [5] ) used a  web robot  to find web pages and to build its index, and used a  web form  as the interface to its query program. It was thus the first WWW resource-discovery tool to combine the three essential features of a web search engine (crawling, indexing, and searching) as described below. Because of the limited resources available on the platform on which it ran, its indexing and hence searching were limited to the titles and headings found in the web pages the crawler encountered.
One of the first "full text" crawler-based search engines was  WebCrawler , which came out in 1994. Unlike its predecessors, it let users search for any word in any webpage, which has become the standard for all major search engines since. It was also the first one to be widely known by the public. Also in 1994,  Lycos  (which started at  Carnegie Mellon University ) was launched and became a major commercial endeavor.
Soon after, many search engines appeared and vied for popularity. These included  Magellan ,  Excite ,  Infoseek ,  Inktomi ,  Northern Light , and  AltaVista .  Yahoo!  was among the most popular ways for people to find web pages of interest, but its search function operated on its  web directory , rather than full-text copies of web pages. Information seekers could also browse the directory instead of doing a keyword-based search.

More Related Content

PPTX
Linked data and rdf
PPTX
Search Engine
PDF
Introduction to Research Objects - Collaboartions Workshop 2015, Oxford
PDF
Web of Data Usage Mining
PPTX
Evolutionary & Swarm Computing for the Semantic Web
PDF
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
PPT
Apache Stanbol 
and the Web of Data - ApacheCon 2011
PPT
Exploring and using the Semantic Web - SSSW09 tutorial
Linked data and rdf
Search Engine
Introduction to Research Objects - Collaboartions Workshop 2015, Oxford
Web of Data Usage Mining
Evolutionary & Swarm Computing for the Semantic Web
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
Apache Stanbol 
and the Web of Data - ApacheCon 2011
Exploring and using the Semantic Web - SSSW09 tutorial

What's hot (20)

PPTX
OpenRefine reconciliation services
PPT
Using OpenURL Activity Data - Activity Data Online Exchange Event
PPTX
Search engines
PPTX
Stanbol
KEY
Transmission6 - Publishing Linked Data
PPTX
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
PPTX
Linked data 101: Getting Caught in the Semantic Web
PPTX
FAIR Projector Builder
PPTX
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
PPT
Locah Project Show and Tell
KEY
It's not rocket surgery - Linked In: ALA 2011
PDF
Statster introduction essay
PPTX
Alternative searchengines ocallaghan
PPT
Linked Data Overview - AGI Technical SIG
PDF
Open Bibliography, Citations and Scholarship
PDF
Keynote session - LOD2014 W3C event
PDF
Linked Data and Archival Description: Confluences, Contingencies, and Conflicts
PDF
Archives & the Semantic Web
ODP
Commodity Semantic Search: A Case Study of DiscoverEd
PDF
Publishing and Using Linked Data
OpenRefine reconciliation services
Using OpenURL Activity Data - Activity Data Online Exchange Event
Search engines
Stanbol
Transmission6 - Publishing Linked Data
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Linked data 101: Getting Caught in the Semantic Web
FAIR Projector Builder
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
Locah Project Show and Tell
It's not rocket surgery - Linked In: ALA 2011
Statster introduction essay
Alternative searchengines ocallaghan
Linked Data Overview - AGI Technical SIG
Open Bibliography, Citations and Scholarship
Keynote session - LOD2014 W3C event
Linked Data and Archival Description: Confluences, Contingencies, and Conflicts
Archives & the Semantic Web
Commodity Semantic Search: A Case Study of DiscoverEd
Publishing and Using Linked Data
Ad

Viewers also liked (20)

PPSX
Mexico by john2
PPTX
13강 기업교육론 20110603
PPS
ομορσιες σος καναδα
PPS
Pamukkale gr
PPTX
Game accessibility and the aging community
PPS
Reuters2
PPTX
Прориси
PDF
El Plagio
PPT
Collective action under autocracies
PPT
10강 기업교육론 20110504
PPS
Luciano pavarotti chitarra romana
PDF
Indemnification to Nominee Director- Allowed or not-CA 2013
PDF
Cost Reductions Process - Results
PPS
The bestof
PPT
Internet messesage
PDF
Modeling Social Data, Lecture 8: Recommendation Systems
PDF
Visualizing geolocated Internet measurements
PPTX
Madrid `in tanitimi
PPS
From the air
Mexico by john2
13강 기업교육론 20110603
ομορσιες σος καναδα
Pamukkale gr
Game accessibility and the aging community
Reuters2
Прориси
El Plagio
Collective action under autocracies
10강 기업교육론 20110504
Luciano pavarotti chitarra romana
Indemnification to Nominee Director- Allowed or not-CA 2013
Cost Reductions Process - Results
The bestof
Internet messesage
Modeling Social Data, Lecture 8: Recommendation Systems
Visualizing geolocated Internet measurements
Madrid `in tanitimi
From the air
Ad

Similar to Web search engine (20)

DOC
Search Engine
PPT
Web search engines ( Mr.Mirza )
PDF
Search Engine Optimization Virtuo.Ca
PPT
Early History of Search Engine by Viterbi School
PPTX
world wide web and its components and the related aspects
PPTX
WWW and description of the web and the internet
PDF
PPTX
Search Engine Made By Hasnain jatt .pptx
PPT
The development of web archiving 3
PDF
World Wide Web - wikipedia.pdf. in Living in IT Era
PPSX
Know The Future of IT - WEB
PPTX
Webcrawler
PPTX
PPT
Introduction To Search - SEO 101
PPTX
Search engines
PDF
A History Of First Search Engine S
PPT
pranav,sahil and shriman presents search engine
PPTX
How search engine works and history of search engine
DOC
Done reread detecting phrase-level duplication on the world wide we
Search Engine
Web search engines ( Mr.Mirza )
Search Engine Optimization Virtuo.Ca
Early History of Search Engine by Viterbi School
world wide web and its components and the related aspects
WWW and description of the web and the internet
Search Engine Made By Hasnain jatt .pptx
The development of web archiving 3
World Wide Web - wikipedia.pdf. in Living in IT Era
Know The Future of IT - WEB
Webcrawler
Introduction To Search - SEO 101
Search engines
A History Of First Search Engine S
pranav,sahil and shriman presents search engine
How search engine works and history of search engine
Done reread detecting phrase-level duplication on the world wide we

Web search engine

  • 1. Web search engine From Wikipedia, the free encyclopedia
  • 2. Search engine" redirects here. For other uses, see Search engine (disambiguation) . A web search engine is designed to search for information on the World Wide Web . The search results are usually presented in a list of results and are commonly called hits . The information may consist of web pages , images, information and other types of files. Some search engines also mine data available in databases or open directories . Unlike Web directories , which are maintained by human editors, search engines operate algorithmically or are a mixture of algorithmic and human input.
  • 3. History During the early development of the web, there was a list of webservers edited by Tim Berners-Lee and hosted on the CERN webserver. One historical snapshot from 1992 remains. [1] As more webservers went online the central list could not keep up. On the NCSA site new servers were announced under the title "What's New!"
  • 4. The very first tool used for searching on the Internet was Archie . [3] The name stands for "archive" without the "v." It was created in 1990 by Alan Emtage , a student at McGill University in Montreal . The program downloaded the directory listings of all the files located on public anonymous FTP ( File Transfer Protocol ) sites, creating a searchable database of file names; however, Archie did not index the contents of these sites.
  • 5. The rise of Gopher (created in 1991 by Mark McCahill at the University of Minnesota ) led to two new search programs, Veronica and Jughead . Like Archie, they searched the file names and titles stored in Gopher index systems. Veronica ( V ery E asy R odent- O riented N et-wide I ndex to C omputerized A rchives) provided a keyword search of most Gopher menu titles in the entire Gopher listings. Jughead ( J onzy's U niversal G opher H ierarchy E xcavation A nd D isplay) was a tool for obtaining menu information from specific Gopher servers. While the name of the search engine "Archie" was not a reference to the Archie comic book series, " Veronica " and " Jughead " are characters in the series, thus referencing their predecessor.
  • 6. In the summer of 1993, no search engine existed yet for the web, though numerous specialized catalogues were maintained by hand. Oscar Nierstrasz at the University of Geneva wrote a series of Perl scripts that would periodically mirror these pages and rewrite them into a standard format which formed the basis for W3Catalog , the web's first primitive search engine, released on September 2, 1993 [4] .
  • 7. In June 1993, Matthew Gray, then at MIT , produced what was probably the first web robot , the Perl -based World Wide Web Wanderer , and used it to generate an index called 'Wandex'. The purpose of the Wanderer was to measure the size of the World Wide Web, which it did until late 1995. The web's second search engine Aliweb appeared in November 1993. Aliweb did not use a web robot , but instead depended on being notified by website administrators of the existence at each site of an index file in a particular format.
  • 8. JumpStation (released in December 1993 [5] ) used a web robot to find web pages and to build its index, and used a web form as the interface to its query program. It was thus the first WWW resource-discovery tool to combine the three essential features of a web search engine (crawling, indexing, and searching) as described below. Because of the limited resources available on the platform on which it ran, its indexing and hence searching were limited to the titles and headings found in the web pages the crawler encountered.
  • 9. One of the first "full text" crawler-based search engines was WebCrawler , which came out in 1994. Unlike its predecessors, it let users search for any word in any webpage, which has become the standard for all major search engines since. It was also the first one to be widely known by the public. Also in 1994, Lycos (which started at Carnegie Mellon University ) was launched and became a major commercial endeavor.
  • 10. Soon after, many search engines appeared and vied for popularity. These included Magellan , Excite , Infoseek , Inktomi , Northern Light , and AltaVista . Yahoo! was among the most popular ways for people to find web pages of interest, but its search function operated on its web directory , rather than full-text copies of web pages. Information seekers could also browse the directory instead of doing a keyword-based search.