Wikipedia:Wikipedia in academic studies 

Shortcut:
WP:ACST

Below is an incomplete list of academic conference presentations, peer-reviewed papers and other types of academic writing which focus on Wikipedia as their subject. Works that mention Wikipedia only in passing are unlikely to be listed.

Unpublished works of presumably academic quality are listed in a dedicated section. For non-academic research, as well as tools that may be useful in researching Wikipedia, see Wikipedia:Researching Wikipedia. For a WikiProject focussed on doing research on Wikipedia, see Wikipedia:WikiProject Wikidemia.

For academic papers using Wikipedia as a source, see Wikipedia:Wikipedia as an academic source, and the bibliography links listed at the bottom of this page. For teaching with Wikipedia, see Wikipedia:School and university projects. For researching with Wikipedia, see Wikipedia:Researching with Wikipedia. For non-academic works focused on Wikipedia, see Wikipedia:Wikipedia in the media.

Contents

Over time

Growth of academic interest in Wikipedia: number of publication by year, from creation of Wikipedia to end of 2008. Source: based on mid-May 2008 revision of this page.

Conference presentations and papers

See also: Wikimania conference series
This table is sortable.
Authors Title Conference / published in Year Online Notes Abstract Keywords


Johannes Schöning, Brent Hecht, Martin Raubal, Antonio Krüger, Meri Marsh, and Michael Rohs Improving Interaction with Virtual Globes through Spatial Thinking: Helping Users Ask "Why?" Intelligent User Interfaces (IUI) 2008 [1] virtual globes, spatial thinking, multi-touch interaction, wall-size interfaces, artificial intelligence, wikipedia, semantic relatedness
Brent Hecht and Johannes Schöning Mapping the Zeitgeist Fifth International Conference on Geographic Information Science (GIScience) 2008 [2] zeitgeist, semantic relatedness, spatialization, spatial wikipedia
Brent Hecht and Martin Raubal Geographically explore semantic relations in world knowledge 11th AGILE International Conference on Geographic Information Science 2008 [3] semantic relatendess, network analysis, non-classical relations, geography, wikipedia
Darren Hardy Discovering behavioral patterns in collective authorship of place-based information Internet Research 9.0: Rethinking Community, Rethinking Place (to appear) 2008 [4] geotagging, peer production, Wikipedia, bots
Andrew Krizhanovsky Index wiki database: design and experiments FLINS'08, Corpus Linguistics'08, AIS/CAD'08 2008 [5] Synarcher corpus linguistics, inverted index, Zipf's law, information retrieval
Torsten Zesch, Christof Müller and Iryna Gurevych Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary LREC'08 2008 [6]
Michael Roth and Sabine Schulte im Walde Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information LREC'08 2008 [7]
Laura Kassner, Vivi Nastase and Michael Strube Acquiring a Taxonomy from the German Wikipedia LREC'08 2008 [8]
Jordi Atserias, Hugo Zaragoza, Massimiliano Ciaramita and Giuseppe Attardi Semantically Annotated Snapshot of the English Wikipedia LREC'08 2008 [9]
Adrian Iftene and Alexandra Balahur-Dobrescu Named Entity Relation Mining using Wikipedia LREC'08 2008 [10]
Gaoying Cui, Qin Lu, Wenjie Li and Yirong Chen Corpus Exploitation from Wikipedia for Ontology Construction LREC'08 2008 [11]
Alexander E. Richman, Patrick Schone Mining Wiki Resources for Multilingual Named Entity Recognition ACL-08: HLT, pp. 1–9 2008 [12]
Michael Kaisser The QuALiM Question Answering Demo: Supplementing Answers with Paragraphs drawn from Wikipedia ACL-08: HLT Demo Session, pp. 32–35 2008 [13]
Elif Yamangil, Rani Nelken Mining Wikipedia Revision Histories for Improving Sentence Compression ACL-08: HLT, Short Papers, pp. 137–140 2008 [14]
Fadi Biadsy, Julia Hirschberg, Elena Filatova An Unsupervised Approach to Biography Production using Wikipedia ACL-08: HLT, pp. 807–815 2008 [15]
Kai Wang, Chien-Liang Lin, Chun-Der Chen, and Shu-Chen Yang The adoption of Wikipedia: a community- and information quality-basaed view 12th Pacific Asia Conference on Information Systems (PACIS) 2008 [16] Wikipedia-Lab work TAM, Wikipedia, Critical Mass, Community identification, Information quality
Carlo A. Curino, Hyun J. Moon, Letizia Tanca, Carlo Zaniolo Schema Evolution in Wikipedia: toward a Web Information System Benchmark International Conference on Enterprise Information System (ICEIS), 2008 [17] Panta Rhei Project Schema Evolution, Benchmark, Schema Versioning, Query Rewriting


Carlo A. Curino, Hyun J. Moon, Carlo Zaniolo Graceful Database Schema Evolution: the PRISM Workbench Very Large DataBases (VLDB), 2008 [] Panta Rhei Project Schema Evolution, Graceful Evolution, Schema Versioning, Query Rewriting
Hyun J. Moon, Carlo A. Curino, Alin Deutsch, Chien-Yi Hou, Carlo Zaniolo Managing and Querying Transaction-time Databases under Schema Evolution Very Large DataBases (VLDB), 2008 [] Panta Rhei Project Schema Evolution, Transaction Time DB, Query Rewriting
Fogarolli Angela and Ronchetti Marco Intelligent Mining and Indexing of Multi-Language e-Learning Material Proc. of 1st International Symposium on Intelligent Interactive Multimedia Systems and Services, KES IIMS 2008, 9-11 July 2008 Piraeus, Greece Studies in Computational Intelligence, Springer-Verlag (2008). Note: to appear. 2008 Content Retrieval, Content Filtering, Search over semi-structural Web sources, Multimedia, e-Learning


Fogarolli Angela and Ronchetti Marco Discovering Semantics in Multimedia Content using Wikipedia Proc. of 11th BIS 2008, 5-7 May 2008 Innsbruck, Austria. Lecture Notes in Business Information Processing, pp. 48–57. Springer, Heidelberg (2008) 2008 Content Retrieval, Content Filtering, Search over semi-structural Web sources, Multimedia, e-Learning
Tyers, F. and Pienaar, J. Extracting bilingual word pairs from Wikipedia SALTMIL workshop at Language Resources and Evaluation Conference (LREC) 2008, (To appear) 2008 Under-resourced languages, Machine translation, Language resources, Bilingual terminology, Interwiki links
Fei Wu, Daniel S. Weld Automatically Refining the Wikipedia Infobox Ontology 17th International World Wide Web Conference (www-08) 2008 [18] The Intelligence in Wikipedia Project at University of Washington Semantic Web, Ontology, Wikipedia, Markov Logic Networks
Maike Erdmann, Kotaro Nakayama, Takahiro Hara, Sojiro Nishio An Approach for Extracting Bilingual Terminology from Wikipedia 13th International Conference on Database Systems for Advanced Applications (DASFAA, To appear) 2008 [19] Wikipedia-Lab work Wikipedia Mining, Bilingual Terminology, Link Structure Analysis
Kotaro Nakayama, Takahiro Hara, Sojiro Nishio A Search Engine for Browsing the Wikipedia Thesaurus 13th International Conference on Database Systems for Advanced Applications, Demo session (DASFAA, To appear) 2008 [20] Wikipedia-Lab work Wikipedia Mining, Association Thesaurus, Link Structure Analysis, XML Web Services
Kotaro Nakayama, Masahiro Ito, Takahiro Hara, Sojiro Nishio Wikipedia Mining for Huge Scale Japanese Association Thesaurus Construction International Symposium on Mining And Web (IEEE MAW, To appear) conjunction with IEEE AINA 2008 [21] Wikipedia-Lab work Wikipedia Mining, Association Thesaurus, Link Structure Analysis


Minghua Pei, Kotaro Nakayama, Takahiro Hara, Sojiro Nishio Constructing a Global Ontology by Concept Mapping using Wikipedia Thesaurus International Symposium on Mining And Web (IEEE MAW, To appear) conjunction with IEEE AINA 2008 [22] Wikipedia-Lab work Wikipedia Mining, Association Thesaurus, Ontology Mapping, Global Ontology
Joachim Schroer, Guido Hertel Voluntary engagement in an open web-based encyclopedia: From reading to contributing 10th International General Online Research Conference, Hamburg, Germany 2008 [23] wikipedia, contributors, motivation, instrumentality, intrinsic motivation
Martin Potthast, Benno Stein, Maik Anderka A Wikipedia-Based Multilingual Retrieval Model 30th European Conference on IR Research, ECIR 2008, Glasgow 2008 [24] multilingual retrieval model, explicit semantic analysis, wikipedia
Martin Potthast, Benno Stein, Robert Gerling Automatic Vandalism Detection in Wikipedia 30th European Conference on IR Research, ECIR 2008, Glasgow 2008 [25] vandalism, machine learning, wikipedia
Ivan Beschastnikh, Travis Kriplean, David W. McDonald Wikipedian Self-Governance in Action: Motivating the Policy Lens Proceedings of the Second International Conference on Weblogs and Social Media, AAAI, March 31, 2008 2008 [26] policy use, governance, wikipedia
Andrea Forte, Amy Bruckman Scaling Consensus: Increasing Decentralization in Wikipedia Governance HICSS 2008, pp. 157-157. 2008 [27] governance, wikipedia
Zareen Syed, Tim Finin, and Anupam Joshi Wikipedia as an Ontology for Describing Documents Proceedings of the Second International Conference on Weblogs and Social Media, AAAI, March 31, 2008 2008 [28] ontology, wikipedia, information retrieval, text classification
Felipe Ortega, Jesus M. Gonzalez-Barahona and Gregorio Robles On the Inequality of Contributions to Wikipedia HICSS 2008 2008 [29] Application of the Gini coefficient to measure the level of inequality of the contributions to the top ten language editions of Wikipedia. wikipedia
Anne-Marie Vercoustre, James A. Thom and Jovan Pehcevski Entity Ranking in Wikipedia SAC’08 March 16-20, 2008, Fortaleza, Ceara, Brazil 2008 [30] Application of the Gini coefficient to measure the level of inequality of the contributions to the top ten language editions of Wikipedia. Entity Ranking, XML Retrieval, Test collection


Brent Hecht, Michael Rohs, Johannes Schöning and Antonio Krüger WikEye - Using Magic Lenses to Explore Georeferenced Wikipedia Content. 3rd International Workshop on Pervasive Mobile Interaction Devices (PERMID) in Conjuncation with Pervasive Computing 2007 [31] wikipedia data-mining, magic lens, augmented reality, markerless tracking


Marek Meyer, Christoph Rensing, Ralf Steinmetz Categorizing Learning Objects Based On Wikipedia as Substitute Corpus First International Workshop on Learning Object Discovery & Exchange (LODE'07), September 18, 2007, Crete, Greece 2007 [32] Usage of Wikipedia as corpus for machine learning methods. Wikipedia, Categorization, Metadata, kNN, Classification, Substitute Corpus, Automatic Metadata Generation
Overell, Simon E., and Stefan Rüger Geographic co-occurrence as a tool for GIR. 4th ACM workshop on Geographical Information Retrieval. Lisbon, Portugal. 2007 [33] Wikipedia, disambiguation, geographic information retrieval
Torsten Zesch, Iryna Gurevych Analysis of the Wikipedia Category Graph for NLP Applications. Proceedings of the TextGraphs-2 Workshop (NAACL-HLT) 2007 [34] nlp, relatedness, semantic, wikipedia
Antonio Toral and Rafael Muñozh Towards a Named Entity Wordnet (NEWN) Proceedings of the 6th International Conference on Recent Advances in Natural Language Processing (RANLP). Borovets (Bulgaria). pp. 604-608 . September 2007 2007 [35] poster?
Ulrik Brandes and Jürgen Lerner Visual Analysis of Controversy in User-generated Encyclopedias Proc. IEEE Symp. Visual Analytics Science and Technology (VAST ' 07), to appear. 2007 [36] social network controversy editing visualisation wikipedia
V Jijkoun, M de Rijke WiQA: Evaluating Multi-lingual Focused Access to Wikipedia Proceedings EVIA, 2007 2007 [37]
Martin Potthast Wikipedia in the pocket: indexing technology for near-duplicate detection and high similarity search SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval 2007 [38] wikipedia
Minier, Zsolt Bodo, Zalan Csato, Lehel Wikipedia-Based Kernels for Text Categorization Symbolic and Numeric Algorithms for Scientific Computing, 2007. SYNASC. International Symposium on 2007 [39]
Thomas, Christopher Sheth, Amit P. Semantic Convergence of Wikipedia Articles Web Intelligence, IEEE/WIC/ACM International Conference on 2007 [40]
Rada Mihalcea Using Wikipedia for Automatic Word Sense Disambiguation Proceedings of NAACL HLT, 2007 2007 [41]
J Yu, JA Thom, A Tam Ontology evaluation using wikipedia categories for browsing Proceedings of the sixteenth ACM conference on Conference on information and knowledge management 2007 [42] browsing, ontology evaluation, user studies, wikipedia
Martin Wattenberg, Fernanda B. Viégas and Katherine Hollenbach Visualizing Activity on Wikipedia with Chromograms Human-Computer Interaction – INTERACT 2007 2007 [43] Wikipedia - Visualization - Peer Production - Visualization
A Kittur, E Chi, BA Pendleton, B Suh, T Mytkowicz Power of the Few vs. Wisdom of the Crowd: Wikipedia and the Rise of the Bourgeoisie 25th Annual ACM Conference on Human Factors in Computing Systems (CHI 2007); 2007 April 28 - May 3; San Jose; CA. 2007 [44] Wikipedia, Wiki, collaboration, collaborative knowledge systems, social tagging, delicious.
Meiqun Hu, Ee-Peng Lim, Aixin Sun, Hady W Lauw, Ba-Quy Vuong On improving wikipedia search using article quality WIDM '07: Proceedings of the 9th annual ACM international workshop on Web information and data management 2007 [45] quality, wikipedia
Wilkinson, Dennis M. and Huberman, Bernardo A. Cooperation and quality in wikipedia WikiSym '07: Proceedings of the 2007 international symposium on Wikis. 2007 [46] Wikipedia, collaborative authoring, cooperation, groupware
DPT Nguyen, Y Matsuo, M Ishizuka Subtree Mining for Relation Extraction from Wikipedia Proc. of NAACL/HLT 2007 2007 [47]
Bongwon Suh, Ed H Chi, Bryan A Pendleton, Aniket Kittur Us vs. Them: Understanding Social Dynamics in Wikipedia with Revert Graph Visualizations Visual Analytics Science and Technology, 2007. VAST 2007. IEEE Symposium on (2007), pp. 163-170. 2007 [48] motivation, social-network, wikipedia
Kittur, Aniket and Suh, Bongwon and Pendleton, Bryan A. and Chi, Ed H. He says, she says: conflict and coordination in Wikipedia CHI '07: Proceedings of the SIGCHI conference on Human factors in computing systems 2007 [49] Wiki, Wikipedia, collaboration, conflict, user model, visualization, web-based interaction
Davide Buscaldi and Paolo Rosso A Comparison of Methods for the Automatic Identification of Locations in Wikipedia Proceedings of GIR’07 2007 [50] Algorithms, Measurement, Performance, text analysis, language models
Li, Yinghao and Wing and Kei and Fu Improving weak ad-hoc queries using wikipedia asexternal corpus SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval 2007 [51] Wikipedia, external corpus, pseudo-relevance feedback
Y Watanabe, M Asahara, Y Matsumoto A Graph-based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) 2007 [52]
Simone Braun and Andreas Schmidt Wikis as a Technology Fostering Knowledge Maturing: What we can learn from Wikipedia 7th International Conference on Knowledge Management (IKNOW '07),Special Track on Integrating Working and Learning in Business (IWL), 2007. 2007 [53] knowledge management wiki wikipedia
Linyun Fu and Haofen Wang and Haiping Zhu and Huajie Zhang and Yang Wang and Yong Yu Making More Wikipedians: Facilitating Semantics Reuse for Wikipedia Authoring Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea, 4825: 127--140, 2007. 2007 [54] semanticWeb web2.0 wikipedia
Sören Auer and Chris Bizer and Jens Lehmann and Georgi Kobilarov and Richard Cyganiak and Zachary Ives DBpedia: A Nucleus for a Web of Open Data Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea, 4825: 715--728, 2007. 2007 [55] information retrieval mashup semantic Web wikipedia
Simone P. Ponzetto and Michael Strube An API for Measuring the Relatedness of Words in Wikipedia Companion Volume to the Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, : 23--30, 2007. 2007 [56] api, relatedness semantic\_web, sematic, wikipedia
Ponzetto, Simone P. and Strube, Michael Deriving a Large Scale Taxonomy from Wikipedia Proceedings of the 22nd National Conference on Artificial Intelligence, Vancouver, B.C., 22-26 July 2007 [57] api, relatedness semantic web, sematic, wikipedia
Simone Paolo Ponzetto Creating a Knowledge Base from a Collaboratively Generated Encyclopedia Proceedings of the NAACL-HLT 2007 Doctoral Consortium, pp 9-12, Rochester, NY, April 2007 2007 [58]
Ralf Schenkel, Fabian Suchanek and Gjergji Kasneci YAWN: A Semantically Annotated Wikipedia XML Corpus BTW2007 2007 [59]
Hugo Zaragoza, Henning Rode, Peter Mika, Jordi Atserias, Massimiliano Ciaramita & Giuseppe Attardi Ranking Very Many Typed Entities on Wikipedia CIKM ‘07: Proceedings of the Sixteenth ACM International Conference on Information and Knowledge Management 2007 [60]
Sören Auer and Jens Lehmann What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content Proceedings of 4th European Semantic Web Conference; published in The Semantic Web: Research and Applications, pages 503-517 2007 [61]
George Bragues Wiki-Philosophizing in a Marketplace of Ideas: Evaluating Wikipedia's Entries on Seven Great Minds Social Science Research Network Working Paper Series (April 2007) 2007 [62] quality, wikipedia
Gang Wang and Yong Yu and Haiping Zhu PORE: Positive-Only Relation Extraction from Wikipedia Text Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea 2007 [63] annotation iswc, knowledge-extraction nlp semantic-web text-mining wikipedia
Fei Wu, Daniel S. Weld Autonomously semantifying wikipedia Proceedings of the sixteenth ACM conference on Conference on information and knowledge management 2007 [64] The Intelligence in Wikipedia Project at University of Washington Information Extraction, Wikipedia, Semantic Web
Viégas, Fernanda The Visual Side of Wikipedia System Sciences, 2007. HICSS 2007. 40th Annual Hawaii International Conference on 2007 [65]
Sean Hansen Nicholas Berente Kalle Lyytinen Wikipedia as Rational Discourse: An Illustration of the Emancipatory Potential of Information Systems Proceedings of Hawaiian International Conference of Systems Sciences Big Island, Hawaii.) 2007 [66]
Fissaha Adafre, Sisay, Jijkoun, Valentin, de Rijke, Maarten Fact Discovery in Wikipedia Web Intelligence, IEEE/WIC/ACM International Conference on 2007 [67] nlp, relatedness, semantic, wikipedia
Li, Bing Chen, Qing-Cai Yeung, Daniel S. Ng, Wing W.Y. Wang, Xiao-Long Exploring Wikipedia and Query Log's Ability for Text Feature Representation Machine Learning and Cybernetics, 2007 International Conference on 2007 [68]
Wei Che Huang, Andrew Trotman, and Shlomo Geva Collaborative Knowledge Management: Evaluation of Automated Link Discovery in the Wikipedia SIGIR 2007 Workshop on Focused Retrieval, July 27, 2007, Amsterdam 2007 [69] Wikipedia, Link-the-Wiki, INEX, Evaluation, DTD, Best Entry Point
Morten Rask The Richness and Reach of Wikinomics: Is the Free Web-Based Encyclopedia Wikipedia Only for the Rich Countries? Proceedings of the Joint Conference of The International Society of Marketing Development and the Macromarketing Society, June 2-5, 2007 2007 [70]