Please use this identifier to cite or link to this item: http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/3635
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKapur, R.
dc.contributor.authorRao, P.U.
dc.contributor.authorDewam, A.
dc.contributor.authorSodhi, B.
dc.date.accessioned2022-07-15T10:34:18Z
dc.date.available2022-07-15T10:34:18Z
dc.date.issued2022-07-15
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/3635
dc.description.abstractSoftware development comprises the use of multiple Third-Party Libraries (TPLs). However, the irrelevant libraries present in software application's distributable often lead to excessive consumption of resources such as CPU cycles, memory, and modile-devices' battery usage. Therefore, the identification and removal of unused TPLs present in an application are desirable. We present a rapid, storage-efficient, obfuscation-resilient method to detect the irrelevant-TPLs in Java and Python applications. Our approach's novel aspects are i) Computing a vector representation of a .class file using a model that we call Lib2Vec. The Lib2Vec model is trained using the Paragraph Vector Algorithm. ii) Before using it for training the Lib2Vec models, a .class file is converted to a normalized form via semantics-preserving transformations. iii) A eXtra Library Detector (XtraLibD) developed and tested with 27 different language-specific Lib2Vec models. These models were trained using different parameters and >30,000 .class and >478,000 .py files taken from >100 different Java libraries and 43,711 Python available at MavenCentral.com and Pypi.com, respectively. XtraLibD achieves an accuracy of 99.48% with an F1 score of 0.968 and outperforms the existing tools, viz., LibScout, LiteRadar, and LibD with an accuracy improvement of 74.5%, 30.33%, and 14.1%, respectively. Compared with LibD, XtraLibD achieves a response time improvement of 61.37% and a storage reduction of 87.93% (99.85% over JIngredient). Our program artifacts are available at https://www.doi.org/10.5281/zenodo.5179747.en_US
dc.language.isoen_USen_US
dc.subjectCode similarityen_US
dc.subjectObfuscationen_US
dc.subjectParagraph vectorsen_US
dc.subjectSoftware bloaten_US
dc.subjectThird-party library detectionen_US
dc.titleXtraLibD: detecting irrelevant third-party libraries in Java and Python applicationsen_US
dc.typeArticleen_US
Appears in Collections:Year-2022

Files in This Item:
File Description SizeFormat 
Full text.pdf2.62 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.