Method and tool support for classifying software languages with Wikipedia

Status
To appear in the proceedings of SLE 2013

Authors
Ralf Lämmel, Dominik Mosen, Andrei Varanovich

Abstract
Wikipedia provides useful input for efforts on mining tax- onomies or ontologies in specific domains. In particular, Wikipedia's categories serve classification. In this paper, we describe a method and a corresponding tool, WikiTax, for exploring Wikipedia's category graph with the objective of supporting the development of a classification of software languages. The category graph is extracted level by level. The extracted graph is visualized in a tree-like manner. Category attributes (i.e., metrics) such as depth are visualized. Irrelevant edges and nodes may be excluded. These exclusions are documented while using a manageable and well-defined set of 'exclusion types' as comments.

Bibtex entry
@inproceedings{wikitax,
  author    = {Ralf L{\"a}mmel and Dominik Mosen and Andrei Varanovich},
  title     = "Method and tool support for classifying software languages with Wikipedia",
  booktitle = "Proc.\ of SLE 2013",
  year      = {2013},
  note      = "10 pages. To appear."
}

Downloads and links