Journey of Web Search Engines: Milestones, Challenges & Innovations

Full Text (PDF, 588KB), PP.47-58

Views: 0 Downloads: 0

Author(s)

Mamta Kathuria 1,* C. K. Nagpal 1 Neelam Duhan 1

1. YMCA University of Science & Technology, Faridabad, 121001, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2016.12.06

Received: 4 Mar. 2016 / Revised: 28 Jun. 2016 / Accepted: 1 Sep. 2016 / Published: 8 Dec. 2016

Index Terms

World Wide Web, Search Engines, Web Search, Information Retrieva

Abstract

Past few decades have witnessed an information big bang in the form of World Wide Web leading to gigantic repository of heterogeneous data. A humble journey that started with the network connection between few computers at ARPANET project has reached to a level wherein almost all the computers and other communication devices of the world have joined together to form a huge global information network that makes available most of the information related to every possible heterogeneous domain. Not only the managing and indexing of this repository is a big concern but to provide a quick answer to the user's query is also of critical importance. Amazingly, rather miraculously, the task is being done quite efficiently by the current web search engines. This miracle has been possible due to a series of mathematical and technological innovations continuously being carried out in the area of search techniques. This paper takes an overview of search engine evolution from primitive to the present.

Cite This Paper

Mamta Kathuria, C. K. Nagpal, Neelam Duhan, "Journey of Web Search Engines: Milestones, Challenges & Innovations", International Journal of Information Technology and Computer Science(IJITCS), Vol.8, No.12, pp.47-58, 2016. DOI:10.5815/ijitcs.2016.12.06

Reference

[1]http://www.internetlivestats.com/total-number-of websites/

[2]http://www.infoplease.com/ipa/A0921862.html

[3]http://www.irkawebpromotions.com/webdirectories/looksmart/

[4]http://www.yuanlei.com/studies/articles/is567- searchengine/page2.htm

[5]https://forums.digitalpoint.com/threads/hybrid-search-engines.2612207/

[6]http://websearch.about.com/od/metasearchengines/a/mamma.htm

[7]https://en.wikipedia.org/wiki/Archie_search_engine

[8]https://en.wikipedia.org/wiki/Veronica &Jughead (search_engine)

[9]http://scg.unibe.ch/archive/software/w3catalog/W3CatalogHistory.html

[10]https://en.wikipedia.org/wiki/W3Catalog

[11]https://en.wikipedia.org/wiki/JumpStation

[12]https://en.wikipedia.org/wiki/World_Wide_Web_Wanderer

[13]http://thesearchenginearchive.wikia.com/wiki/Aliweb

[14]http://www.sciencedaily.com/terms/web_crawler.htm

https://en.wikipedia.org/wiki/MetaCrawler  

[15]http://malwaretips.com/blogs/remove-mywebsearch/

[16]http://www.livinginternet.com/w/wu_sites_lycos.htm

[17]http://searchenginewatch.com/sew/news/2047873/inktomi-debuts-self-serve-paid-inclusion

[18]https://en.wikipedia.org/wiki/Infoseek

[19]https://en.wikipedia.org/wiki/Excite

[20]http://searchenginewatch.com/sew/study/2067828/altavistas-search-by-language-feature

[21]http://www.searchengineshowdown.com/features/yahoo/review.html

[22]https://en.wikipedia.org/wiki/Yahoo!_Search

[23]https://en.wikipedia.org/wiki/AOL

[24]http://www.msn.com/en-in/

[25]https://en.wikipedia.org/wiki/Dogpile

[26]http://investor.blucora.com/releasedetail.cfm?ReleaseID=166325

[27]http://chj.tbe.taleo.net/chj04/ats/careers/requisition.jsp?org=INFOSPACE&cws=1&rid=181

[28]https://en.wikipedia.org/wiki/HotBot

[29]http://www.searchengineshowdown.com/features/hotbot/review.html

[30]https://en.wikipedia.org/wiki/Wow!_(online_service)

[31]https://en.wikipedia.org/wiki/Ask.com

[32]http://www.searchengineshowdown.com/features/ask/review.html

[33]https://en.wikipedia.org/wiki/Daum_(web_portal)

[34]http://www.search-marketing.info/search-engines/price-per-click/overture.htm

[35]https://en.wikipedia.org/wiki/Yandex_Search

[36]https://en.wikipedia.org/wiki/Google_Search#calculator

[37]http://www.telegraph.co.uk/technology/google/10346736/Google-search-15-hidden-features.html

[38]https://en.wikipedia.org/wiki/AlltheWeb

[39]http://www.seochat.com/c/a/marketing/web-directories/teoma-the-superior-search-engine/

[40]https://en.wikipedia.org/wiki/Baidu

[41]https://en.wikipedia.org/wiki/Live_search

[42]https://en.wikipedia.org/wiki/DuckDuckGo

[43]D.Horowitz, S.D. Kamvar, “The Anatomy of a Large-Scale Social Search Engine”, International World Wide Web Conference Committee (IW3C2), 2010, April 26–30, 2010, Raleigh, North Carolina, USA, ACM 978-1-60558-799-8/10/04.

[44]http://www.windowscentral.com/top-bing-features

[45]http://www.telegraph.co.uk/technology/google/6009176/Google-reveals-caffeine-a-new-faster-search engine.html

[46]http://searchengineland.com/google-instant-complete-users-guide-50136

[47]https://en.wikipedia.org/wiki/Blekko

[48]https://www.aihitdata.com/company/00D2051A/CONTENKO/history

[49]https://en.wikipedia.org/wiki/Althea

[50]http://www.searchenginejournal.com/seo-guide/google-penguin-panda-hummingbird/

[51]TuukkaRuotsalo,KumaripabaAthukorala,     DorotaGÅ‚owacka, KseniaKonyushkova, AnttiOulasvirta, SamuliKaipiainen, Samuel Kaski, GiulioJacucci, “Supporting Exploratory Search Tasks with Interactive User Modeling” ,Helsinki Institute for Information Technology HIIT, University of Helsinki, ASIST 2013, November 1-6, 2013

[52]https://schema.org/docs/faq.html

[53]https://www.ietf.org/

[54]https://www.inbenta.com/en/blog/entry/keyword- based-versus-natural-language-search

[55]R.Priyadarshini, LathaTamilselvan, “Document clustering based on keyword frequency and concept matching technique in Hadoop”, International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 1367 ISSN 2229-5518

[56]DanushkaBollegala, Yutaka Matsuo, and Mitsuru Ishizuka, “A Web Search Engine-Based Approach to Measure Semantic Similarity between Words” IEEE Transactions on Knowledge and Data Engineering, vol. 23, NO. 7, July 2011

[57]D. Mclean, Y. Li, and Z.A. Bandar, “An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources,” IEEE Trans. Knowledge and Data Eng., vol. 15, no. 4, pp. 871-882, July/Aug. 2003.

[58]Elias Iosif, Alexandros Potamianos, “Unsupervised Semantic Similarity Computation between TermsUsing Web Documents”, IEEE Transactions on knowledge and data engineering, vol. 22, no. 11, november 2010

[59]Y. Li, D. McLean, Z. A. Bandar, J. D. O'Shea, and K. Crockett, ``Sentence similarity based on semantic nets and corpus statistics,'' IEEE Trans.Knowl. Data Eng., vol. 18, no. 8, pp. 1138_1150, Aug. 2006.

[60]Tao Cheng, Hady W. Lauw, and SteliosPaparizos, “Entity Synonyms for Structured Web Search”, IEEE Transactions on Knowledge and data engineering, vol. 24, no. 10, October 2012

[61]Tim Converse, Ronald M. Kaplan, Barney Pell, Scott Prevost, Lorenzo Thione, Chad Walters, “Powerset’s Natural Language Wikipedia Search Engine”,Powerset, Inc. 475 Brannan Street San Francisco, California 94107

[62]https://www.crunchbase.com/organization/hakia       and Website: http://www.hakia.com

[63]http://arstechnica.com/information technology/2012/06/inside-the-architecture-of-   googles-      knowledge-graph-and-microsofts-satori/

[64]http://www.cnet.com/news/microsofts-bing-seeks-enlightenment-with-satori/

[65]https://en.wikipedia.org/wiki/Knowledge_Graph

[66]AdityaParameswaran, AnandRajaraman, Hector    Garcia-Molina, “Towards The Web Of Concepts: Extracting Concepts fromLarge Datasets”, publisher, ACM, VLDB ‘10, September 13-17, 2010, Singapore(http://ilpubs.stanford.edu:8090/917/1/conceptMining-Techrep.pdf)

[67]http://www.freebase.com

[68]http://www.technologyreview.com/news/410961/making-search-social/, http://www.yotify. com /

[69]https://en.wikipedia.org/wiki/Deep_web(search        

[70]https://en.wikipedia.org/wiki/Tor(anonymity_network) 

[71]https://en.wikipedia.org/wiki/I2P

[72]http://www.lib.vt.edu/find/databases/I/infomine-  search- engine.html

[73]https://en.wikipedia.org/wiki/Intute

[74]http://websearch.about.com/od/invisibleweb/a/completeplanet.htm

[75]http://www.infoplease.com/

[76]http://content.lib.utah.edu/cdm/ref/collection/uspace/id/5477

[77]http://www.seochat.com/c/a/search-engine-optimization-help/search-engines-for-the-invisible-web/

[78]http://searchenginewatch.com/sew/news/2065996/scirus-a-new-science-search-engine

[79]http://library.poly.edu/news/2007/10/09/techxtra-search-engine-for-engineering-mathematics-and-computing

[80]https://www.deepdotweb.com/how-to-access-onion-sites/

[81]http://thehackernews.com/2015/02/Onion-city-darknet-seach-engine.html

[82]www.alexa.com/siteinfo/

[83]http://www. ebizmba.com/articles/search-engines

[84]Deital P J and Deital H M, “ Internet & World Wide Web, How to Program”, Pearson International Edition, 4th edition, 2013

[85]C Jouis, I Biskri, J G Ganascia, M Roux, “ Next Generation Search Engines: Advanced Models for Information Retreival”, Information Science Reference,2012  

[86]J.Bernard, S. Amanda,” How are we searching the world wide web?: a comparison of nine search engine transaction logs” Information Processing and Management: an International Journal(Elsevier), Volume 42 Issue 1, January 2006, Pages 248-263

[87]R Aravindhan, R. Shanmugalakshmi "Comparative analysis of Web 3.0 search engines: A survey report", International Conference on Advanced Computing and Communication Systems (ICACCS), IEEE Conference Publications, 2013,  Page(s): 1 – 6 

[88]Leslie S. Hiraoka ,” Evolution of the Search Engine in Developed and Emerging Markets”, International Journal of Information Systems and Social Change(DBLP), Vol. 5 Issue 1, January 2014, pp.30-46

[89]Capra, R.G.P. Quinones, “Using Web search engines to find and refind information” IEEE Journals & Magazines 2005, Volume: 38, Issue: 10 DOI: 10.1109/MC.2005.355, Page(s): 36 - 42

[90]YipingKe, Lin Deng, Wilfred Ng, Dik-Lun Lee, “ Web dynamics and their ramifications for the development of web search engines”, The International Journal of Computer and Telecommunications Networking-Web dynamics, Elsevier North-Holland, Inc. New York, NY, USA, Volume 50 Issue 10, 14 July 2006, Pages 1430 - 1447 

[91]P. Metaxas, “Web Spam, Social Propaganda and the Evolution of Search Engine Rankings”, SOFSEM 2007:Theory and Practice of Computer Science, Lecture Notes in Computer Science Volume 4362, 2007, pp 1-8

[92]Q. Yang, H. Wang, J. Wen, G. Zhang, Y. Lu, K. Lee, H. Zhang, “Towards a Next-Generation Search Engine”, The Connected Home: The Future of Domestic Life(Science Direct) 2011, pp 79-91

[93]Monica Peshave, KamyarDezhgosha, “How Search Engines Work and a Web Crawler Application” 

[94]D.Horowitz, S.D. Kamvar, “The Anatomy of a Large-Scale Social Search Engine”, International World Wide Web Conference Committee (IW3C2), 2010, April 26–30, 2010, Raleigh, North Carolina, USA, ACM 978-1-60558-799-8/10/04.

[95]Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, PieroFraternali, Silvia Quarteroni, “Search Engines”, Advanced Topics in Information Retrieval, The Information Retrieval Series Volume 33, 2011, pp 27-50

[96]Ricardo BaezaYates, Alvaro Pereira Jr, NivioZiviani, “The Evolution of Web Content and Search Engines”, WEBKDD'06, August 20, 2006, Philadelphia, Pennsylvania, USA. Copyright 2006 ACM 1-59593-444-8... $5.00

[97]Gray, Matthew. "Internet Growth and Statistics: Credit and Background". mRetrieved February 3, 2014.

[98]A. Ntoulas, J. Cho, C. Olston, "What's New on   the Web ? The Evolution of the Web from a Search Engine Perspective", In Proceedings of the World-Wide Web Conference (WWW), May 2004. 

[99]ArvindArasu, Junghoo Cho, Hector Garcia-  Molina, Andreas Paepcke, SriramRaghavan, "Searching the Web", ACM Transactions on Internet Technology, 1(1): August 2001. 

[100]Dirk Lewandowski, “Web searching, search engines and Information Retrieval, Information Services & Use”, 25 (2005) 137-147, IOS Press, 2005. 

[101]Tom Seymour, Dean Frantsvog,  Satheesh Kumar, “History Of Search Engines”,International Journal of Management & Information Systems – Fourth Quarter 2011 Volume 15, Number 4

[102]TuukkaRuotsalo, KumaripabaAthukorala, DorotaGÅ‚owacka, KseniaKonyushkova, AnttiOulasvirta, SamuliKaipiainen, Samuel Kaski, GiulioJacucci, “Supporting Exploratory Search Tasks with Interactive User Modeling” ,Helsinki Institute for Information Technology HIIT, University of Helsinki, ASIST 2013, November 1-6, 2013

[103]Marchionini, G, “Exploratory search: from finding to understanding”, Comm. ACM 49, (2006), 41-46.

[104]Gromov, G. R.,”History of Internet and WWW: the roads and crossroads of Internet history”. from http://www.netvalley.com/intvalstat.html, Retrieved December 5, 2004

[105]Holzschlag, M. E.,” How specialization limited the Web”, Retrieved December 4, 2004, from http://www.webtechniques.com/archives/2001/09/desi/

[106]Jansen, B. J., Spink, A. & Pedersen, J., ”An analysis of multimedia searching on AltaVista”, Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval, (2003) 186-192.

[107]Kherfi, M. L., Ziou, D. &Bernardi, A., “Image retrieval from the World Wide Web” issues, techniques and systems. ACM Computer Surveys, (2004),36(14), 35-67.

[108]Wall, A., “History of search engines & web history”, Retrieved December 3, 2004, from http://www.search-marketing.info/search-engine-history/

[109]Jansen, B. J., Spink, A. & Pedersen, J., “An analysis of multimedia searching on AltaVista”, Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval, (2003), 186-192.

[110]ArvindArasu, Junghoo Cho, Hector Garcia-Molina, Andreas Paepcke, SriramRaghavan, “Searching the Web”, (Stanford University). ACM Transactions on Internet Technology (TOIT), Volume 1, Issue 1 (August 2001).

[111]Elgesem, D, “Search Engines and the Public Use of Reason.” Ethics and Information Technology, 10(4), 2008

[112]Nagenborg, M. (ed.), 2005. The Ethics of Search Engines.Special Issue of International Review of Information Ethics.Vol. 3.

[113]“Search Engines, Personal Information, and the Problem of Protecting Privacy in Public,” International Review of Information Ethics, 3: 39–45.

[114]Bruce Croft, Donald Metzler, and Trevor Strohman, “Search Engines: Information Retrieval in Practice”, Addison Wesley, 2010

[115]Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener, “A large-scale study of the evolution of web pages”, WWW ’03: Proceedings of the 12th international conference on World Wide Web, pages 669–678, 2003.