Please use this identifier to cite or link to this item:
http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/1894
Title: | WiTPy: a toolkit to parse and analyse wikipedia talk pages |
Authors: | Verma, A. A. Iyengar, S.R.S. Gandhi, N. |
Keywords: | Wikipedia opensource text-mining NLP |
Issue Date: | 22-Jun-2021 |
Abstract: | In this article, we propose an opensource toolkit to extract, parse, and analyze the Wikipedia talk pages. The core parser uses a tree-based approach to parse the unstructured comments and a JSON(JavaScript Object Notation) structure to store them in a NoSQL(not only SQL) database. User-friendly and high-level analysis methods are created on the top of NoSQL database, which can be used to understand the collaboration dynamics on article talk pages. CCS CONCEPTS • Information systems → Specialized information retrieval; • Human-centered computing → Wikis. |
URI: | http://localhost:8080/xmlui/handle/123456789/1894 |
Appears in Collections: | Year-2020 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fulltext.pdf | 2.21 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.