Work place: Department of Computer Science, University of Ilorin, P.M.B 1515, Ilorin, Nigeria
E-mail: mmabayoje@yahoo.com
Website:
Research Interests: Software Construction, Software Development Process, Software Engineering, Artificial Intelligence
Biography
M. A. Mabayoje is a lecturer in the department of Computer Science, University of Ilorin, Nigeria. She bagged Bachelor of Science and Master of Science in Computer Science in the University of Ilorin, Nigeria. She is currently a PhD student in the same university. She is a member of Nigeria Computer, Society, Science Association of Nigeria among others. Her research interests include ontology, Artificial Intelligence, Software Engineering. She is married with children.
By Modinat. A. Mabayoje O. S. Oni Olawale S. Adebayo
DOI: https://doi.org/10.5815/ijcnis.2013.03.01, Pub. Date: 8 Mar. 2013
With the amount of available text data on the web growing rapidly, the need for users to search such information is dramatically increasing. Full text search engines and relational databases each have unique strengths as development tools but also have overlapping capabilities. Both can provide for storage and update of data and both support search of the data. Full text systems are better for quickly searching high volumes of unstructured text for the presence of any word or combination of words. They provide rich text search capabilities and sophisticated relevancy ranking tools for ordering results based on how well they match a potentially fuzzy search request. Relational databases, on the other hand, excel at storing and manipulating structured data -- records of fields of specific types (text, integer, currency, etc.). They can do so with little or no redundancy. They support flexible search of multiple record types for specific values of fields, as well strong tools for quickly and securely updating individual records. The web being a collection of largely unstructured document which is ever growing in size, the appeal of using RDBMS for searching this collection of documents has become very costly.
This paper describes the architecture, design and implementation of a prototype website search engine powered by Lucene to search through any website. This approach involves the development of a small scale web crawler to gather information from the desired website. The gathered information are then converted to a Lucene document and stored in the index. The time taken to search the index is very short when compared with how long it takes for a relational database to process a query.
Subscribe to receive issue release notifications and newsletters from MECS Press journals