Please use this identifier to cite or link to this item:
http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/1894
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Verma, A. A. | - |
dc.contributor.author | Iyengar, S.R.S. | - |
dc.contributor.author | Gandhi, N. | - |
dc.date.accessioned | 2021-06-21T22:35:04Z | - |
dc.date.available | 2021-06-21T22:35:04Z | - |
dc.date.issued | 2021-06-22 | - |
dc.identifier.uri | http://localhost:8080/xmlui/handle/123456789/1894 | - |
dc.description.abstract | In this article, we propose an opensource toolkit to extract, parse, and analyze the Wikipedia talk pages. The core parser uses a tree-based approach to parse the unstructured comments and a JSON(JavaScript Object Notation) structure to store them in a NoSQL(not only SQL) database. User-friendly and high-level analysis methods are created on the top of NoSQL database, which can be used to understand the collaboration dynamics on article talk pages. CCS CONCEPTS • Information systems → Specialized information retrieval; • Human-centered computing → Wikis. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Wikipedia | en_US |
dc.subject | opensource | en_US |
dc.subject | text-mining | en_US |
dc.subject | NLP | en_US |
dc.title | WiTPy: a toolkit to parse and analyse wikipedia talk pages | en_US |
dc.type | Article | en_US |
Appears in Collections: | Year-2020 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fulltext.pdf | 2.21 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.