![]()
Which article is the closest (in number of links) to all the others? Find shortest paths between wikipedia articles.
150GB of XML (after it's uncompressed) is far too much data to run any sort of analysis over. It also contains a pile of useless information, for example I didn't care about the content of the articles, only which other articles…










