Tag: JSON / JAVA / XML

Community

Parsing Wikipedia XML dump

(Ilia Reznik and Vladimir Shatalov) Wikipedia is a perfect object for data mining and much research focuses on various techniques to retrieve information of interest from it. For online extraction, Rion Williams utilized in his project a powerful library designed by Petr Onderka. Wikipedia itself p