Please use this identifier to cite or link to this item:
http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/3370
Title: | MDCADNet: multi dilated & context aggregated dense network for non-textual components classification in digital documents |
Authors: | Singh, M. Goyal, P. |
Keywords: | Chart classification Chart understanding DenseNet Document intelligence Multi dilation |
Issue Date: | 23-Apr-2022 |
Abstract: | Non-Textual images like charts and tables are unlike natural images in various aspects, including high inter-class similarities, low intra-class similarities, substantial textual component proportions, and lower resolutions. This paper proposes a novel Multi-Dilated Context Aggregation based Dense Network (MDCADNet) addressing the multi-resolution and larger receptive field modeling need for the non-textual component classification task. MDCADNet includes a densely connected convolutional network for the feature map computation as front-end with a multi-dilated Backend Context Module (BCM). The proposed BCM generates multi-scale features and provides a systematic context aggregation of both low and high-level feature maps through its densely connected layers. Additionally, the controlled multi-dilation scheme offers a more extensive scale range for better prediction performance. A thorough quantitative evaluation has been performed on seven benchmark datasets for demonstrating the generalization capability of MDCADNet. Experimental results show MDCADNet performs consistently better than the state-of-the-art models across all datasets. |
URI: | http://localhost:8080/xmlui/handle/123456789/3370 |
Appears in Collections: | Year-2022 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Full Text.pdf | 3.69 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.