Mining Wikipedia to Rank Rock Guitarists

Full Text (PDF, 450KB), PP.50-56

Views: 0 Downloads: 0

Author(s)

Muazzam A. Siddiqui 1,*

1. Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Saudi Arabia

* Corresponding author.

DOI: https://doi.org/10.5815/ijisa.2015.12.05

Received: 5 Mar. 2015 / Revised: 28 Jul. 2015 / Accepted: 5 Sep. 2015 / Published: 8 Nov. 2015

Index Terms

Wikipedia mining, PageRank for people, information extraction, text mining, music data mining

Abstract

We present a method to find the most influential rock guitarist by applying Google PageRank algorithm to information extracted from Wikipedia articles. The influence of a guitarist was estimated by the number of guitarists citing him/her as an influence and the influence of the latter. We extracted this who-influenced-whom data from the Wikipedia biographies and converted them to a directed graph where a node represented a guitarist and an edge between two nodes indicated the influence of one guitarist over the other. Next we used Google PageRank algorithm to rank the guitarists. The results are most interesting and provide a quantitative foundation to the idea that most of the contemporary rock guitarists are influenced by early blues guitarists. Although no direct comparison exist, the list was still validated against a number of other best-of lists available online and found to be mostly compatible.

Cite This Paper

Muazzam A. Siddiqui, "Mining Wikipedia to Rank Rock Guitarists", International Journal of Intelligent Systems and Applications(IJISA), vol.7, no.12, pp.50-56, 2015. DOI:10.5815/ijisa.2015.12.05

Reference

[1]T. Staff, "The greatest guitarists of all time, in pictures," [Online]. Available:
http://www.telegraph.co.uk/culture/culturepicturegalleries/9618556/The-greatest-guitarists-of-all-time-in-pictures.html. [Accessed 18 02 2015].
[2]J. Tyrangiel, "The 10 Greatest Electric Guitar Players," 18 02 2015. [Online]. Available: http://content.time.com/time/photogallery/0,29307,1916544,00.html.
[3]S. Staff, "SPIN's 100 Greatest Guitarists of All Time," SPIN, 3 05 2012. [Online]. Available: http://www.spin.com/articles/spins-100-greatest-guitarists-all-time/. [Accessed 18 2 2015].
[4]G. W. Staff, "Readers Poll Results: The 100 Greatest Guitarists of All Time," Guitar World, 10 10 2012. [Online]. Available: http://www.guitarworld.com/readers-poll-results-100-greatest-guitarists-all-time. [Accessed 18 02 2015].
[5]"Gibson.com Top 50 Guitarists of All Time – 10 to 1," 28 05 2010. [Online]. Available: http://www2.gibson.com/news-lifestyle/features/en-us/top-50-guitarists-528.aspx. [Accessed 18 02 2015].
[6]D. Browne, P. Doyle, D. Fricke, W. Hermes, B. Hiatt, A. Light, R. Tannenbaum and D. Wolk, "100 Greatest Guitarists," Rolling Stone, 23 11 2011. [Online]. Available: http://www.rollingstone.com/music/lists/100-greatest-guitarists. [Accessed 18 2 2015].
[7]S. Skien and C. Ward, Who's Bigger? Where Historical Figures Really Rank, Cambridge University Press, 2013.
[8]F. Bellomi and R. Bonato, "Network analysis for Wikipedia," in Proceedings of Wikimania 2005, The First International Wikimedia, 2005.
[9]Y.-H. Eom, P. Aragón, D. Laniado, A. Kaltenbrunner, S. Vigna and D. Shepelyansky, "Interactions of cultures and top people of Wikipedia from ranking of 24 languages," Submitted to PLOS ONE, 2014.
[10]Daud, F. Muhammad, H. Dawood and H. Dawood, "Ranking Cricket Teams," Information Processing & Management, vol. 51, no. 2, pp. 62-73, 03 2015.
[11]D. Barman, R. Singha and N. Chowdhury, "Prediction of Possible Business of a Newly Launched Film using Ordinal Values of Film-genres," International Journal of Intelligent Systems and Applications(IJISA), vol. 5, no. 6, pp. 53-60, 2013.
[12]R. Sharda and D. Delen, "Predicting box-office success of motion pictures with neural networks," Expert Systems with Applications, vol. 30, pp. 243-254, 2006.
[13]M. Lab, "WikiPedia Extractor".
[14]Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard and D. McClosky, " The Stanford CoreNLP Natural Language Processing Toolkit," in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2014.
[15]L. Page, S. Brin, R. Motwani and T. Winograd, The PageRank Citation Ranking: Bringing Order to the Web, 1999.
[16]G. Csardi and T. Nepusz, "The iGraph Software Package for Complex Network Research," InterJournal, vol. Complex Systems, p. 1695, 2006.
[17]R. D. C. Team, R: A Language and Environment for Statistical Computing, Vienna: R Foundation for Statistical Computing, 2008.
[18]J. Finkel, T. Grenager and C. Manning, "Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling," in Proceedings of the 43nd Annual Meeting of the Association for Computational Linguistics (ACL 2005), 2005.