INSTITUTIONAL DIGITAL REPOSITORY

WiTPy: a toolkit to parse and analyse wikipedia talk pages

Show simple item record

dc.contributor.author Verma, A. A.
dc.contributor.author Iyengar, S.R.S.
dc.contributor.author Gandhi, N.
dc.date.accessioned 2021-06-21T22:35:04Z
dc.date.available 2021-06-21T22:35:04Z
dc.date.issued 2021-06-22
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/1894
dc.description.abstract In this article, we propose an opensource toolkit to extract, parse, and analyze the Wikipedia talk pages. The core parser uses a tree-based approach to parse the unstructured comments and a JSON(JavaScript Object Notation) structure to store them in a NoSQL(not only SQL) database. User-friendly and high-level analysis methods are created on the top of NoSQL database, which can be used to understand the collaboration dynamics on article talk pages. CCS CONCEPTS • Information systems → Specialized information retrieval; • Human-centered computing → Wikis. en_US
dc.language.iso en_US en_US
dc.subject Wikipedia en_US
dc.subject opensource en_US
dc.subject text-mining en_US
dc.subject NLP en_US
dc.title WiTPy: a toolkit to parse and analyse wikipedia talk pages en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account