Please use this identifier to cite or link to this item: http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/1894
Title: WiTPy: a toolkit to parse and analyse wikipedia talk pages
Authors: Verma, A. A.
Iyengar, S.R.S.
Gandhi, N.
Keywords: Wikipedia
opensource
text-mining
NLP
Issue Date: 22-Jun-2021
Abstract: In this article, we propose an opensource toolkit to extract, parse, and analyze the Wikipedia talk pages. The core parser uses a tree-based approach to parse the unstructured comments and a JSON(JavaScript Object Notation) structure to store them in a NoSQL(not only SQL) database. User-friendly and high-level analysis methods are created on the top of NoSQL database, which can be used to understand the collaboration dynamics on article talk pages. CCS CONCEPTS • Information systems → Specialized information retrieval; • Human-centered computing → Wikis.
URI: http://localhost:8080/xmlui/handle/123456789/1894
Appears in Collections:Year-2020

Files in This Item:
File Description SizeFormat 
Fulltext.pdf2.21 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.