Detecting and Characterising Mobile App Metamorphosis in Google Play Store

A. Mahanti; A. Seneviratne; B. Silva; D. Denipitiyage; K. Gunathilaka; S. Chawla; S. Seneviratne

arxiv: 2407.14565 · v2 · submitted 2024-07-19 · 💻 cs.SE · cs.AI· cs.CV

Detecting and Characterising Mobile App Metamorphosis in Google Play Store

D. Denipitiyage , B. Silva , K. Gunathilaka , S. Seneviratne , A. Mahanti , A. Seneviratne , S. Chawla This is my paper

Pith reviewed 2026-05-23 22:47 UTC · model grok-4.3

classification 💻 cs.SE cs.AIcs.CV

keywords app metamorphosisGoogle Play Storeapp re-brandingre-purposingsecurity risksprivacy risksmulti-modal searchapp snapshots

0 comments

The pith

A multi-modal search on two Google Play Store snapshots five years apart detects apps that undergo major identity or purpose changes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines app metamorphosis as significant shifts in an app's use cases or market positioning that go beyond normal incremental updates. It introduces a multi-modal search method to locate such apps by comparing two full snapshots of the Google Play Store taken five years apart. The approach surfaces distinct patterns including re-births, re-branding, and re-purposing, and assigns a success score showing that some transformed apps outperform average top apps by roughly 11 percent. At the same time the work flags that these changes can conceal security and privacy risks for users.

Core claim

We define this previously unstudied phenomenon as 'app metamorphosis'. In this paper, we propose a novel and efficient multi-modal search methodology to identify apps undergoing metamorphosis and apply it to analyse two snapshots of the Google Play Store taken five years apart. Our methodology uncovers various metamorphosis scenarios, including re-births, re-branding, re-purposing, and others, enabling comprehensive characterisation. Although these transformations may register as successful for app developers based on our defined success score metric (e.g., re-branded apps performing approximately 11.3% better than an average top app), we shed light on the concealed security and privacy risk

What carries the argument

The multi-modal search methodology applied to two snapshots of the Google Play Store five years apart.

If this is right

Re-branded apps register approximately 11.3 percent higher on the defined success score than an average top app.
Metamorphosis scenarios such as re-births and re-purposing can be systematically catalogued.
Transformed apps can carry concealed security and privacy risks that affect even tech-savvy users.
Some transformations register as commercially successful for developers despite the underlying changes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Stores could add version-history flags that alert users when an app's core function has shifted.
The same search technique might be applied to more frequent snapshots to track how often apps change direction.
Reputation resets through re-branding could affect how rating systems should handle app identity over time.

Load-bearing premise

That a multi-modal search across two store snapshots five years apart can reliably locate genuine cases of app metamorphosis without substantial false positives or missed instances.

What would settle it

A manual review of a random sample of flagged apps that finds either many false detections or a large number of actual transformations the method missed.

Figures

Figures reproduced from arXiv: 2407.14565 by A. Mahanti, A. Seneviratne, B. Silva, D. Denipitiyage, K. Gunathilaka, S. Chawla, S. Seneviratne.

**Figure 2.** Figure 2: Creation of validation and test sets from 2018 and 2023 datasets. [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Overall methodology for query, key, and value based best match (if it exists - as emphasised in the purple colour path) retrieval for counterpart app [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Majority voting occurrences in each row-wise-position as the most common result. In the figure, each of these selections are visualised in the grey colour box. As shown in [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: Identifiable mappings and regions of interests that are obtainable from our similarity matching algorithm. Area under each pie segment is indicative [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 5.** Figure 5: 116 in purple and 182 in teal). We further filter them [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Examples for re-birth. Mentioned in italics are the developer name [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: CDF plots for the selected metamorphosis categories. X axis represents the success score (SS %). [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

**Figure 8.** Figure 8: Examples for re-branding. (*: indicates 2018 version.) The success [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗

**Figure 9.** Figure 9: (a) a special example where an app re-purposed and a re-birth occurred using a different app ID. (b) an example where an app changed to a different [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗

**Figure 10.** Figure 10: Diagram of 10 most common genre (a) and content rating (b) changes [PITH_FULL_IMAGE:figures/full_fig_p010_10.png] view at source ↗

**Figure 11.** Figure 11: Some examples of apps where the target demography changed based [PITH_FULL_IMAGE:figures/full_fig_p011_11.png] view at source ↗

**Figure 12.** Figure 12: Progressive version examples for two apps, [PITH_FULL_IMAGE:figures/full_fig_p012_12.png] view at source ↗

**Figure 13.** Figure 13: Security Risks of Mobile App Metamorphosis. Outlined in red are [PITH_FULL_IMAGE:figures/full_fig_p012_13.png] view at source ↗

**Figure 14.** Figure 14: Percentage change of permissions according to the risk category [PITH_FULL_IMAGE:figures/full_fig_p013_14.png] view at source ↗

read the original abstract

App markets have evolved into highly competitive and dynamic environments for developers. While the traditional app life cycle involves incremental updates for feature enhancements and issue resolution, some apps deviate from this norm by undergoing significant transformations in their use cases or market positioning. We define this previously unstudied phenomenon as 'app metamorphosis'. In this paper, we propose a novel and efficient multi-modal search methodology to identify apps undergoing metamorphosis and apply it to analyse two snapshots of the Google Play Store taken five years apart. Our methodology uncovers various metamorphosis scenarios, including re-births, re-branding, re-purposing, and others, enabling comprehensive characterisation. Although these transformations may register as successful for app developers based on our defined success score metric (e.g., re-branded apps performing approximately 11.3% better than an average top app), we shed light on the concealed security and privacy risks that lurk within, potentially impacting even tech-savvy end-users.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper names 'app metamorphosis' for major app re-branding or re-purposing and applies multi-modal matching across two Play Store snapshots, but the matching step has no reported validation so the counts and 11.3% figure rest on untested assumptions.

read the letter

The core contribution is straightforward: they notice that some apps do not just update but shift their entire use case or identity, call it metamorphosis, and build a search that combines name, description, icons and metadata to pull examples from snapshots five years apart. That definition and the multi-modal approach on real store data are new. The work also gives concrete scenarios (re-births, re-branding, re-purposing) and a simple success score that puts re-branded apps roughly 11 percent ahead of average top apps. Those pieces are useful for anyone who tracks how apps actually evolve in the market rather than assuming steady incremental change. The security and privacy angle is mentioned but stays high-level. The soft spot is the matching itself. The abstract and summary give no precision or recall numbers, no hand-labeled test set, and no discussion of how they handle name collisions, developer hand-offs, or gradual description drift. Without that, it is hard to know whether the reported scenario counts are inflated by false positives or whether the 11.3 percent edge survives stricter filtering. The success score is also a free parameter whose exact construction is not shown here. If the full paper still omits a validation section, the quantitative claims become difficult to rely on. This is the kind of paper that belongs in a software engineering or app-ecosystem venue. Readers who study store dynamics or mobile security will find the framing and the data snapshots worth seeing, even if they end up re-running the matching with their own checks. It is coherent on its own terms and engages the literature enough to deserve referee time rather than a desk reject. I would send it out for review so the method details can be examined directly.

Referee Report

2 major / 1 minor

Summary. The paper defines 'app metamorphosis' as significant transformations in mobile apps' use cases or market positioning on the Google Play Store. It proposes a multi-modal search methodology (name + description + icons + metadata) applied to two snapshots five years apart to detect and characterize scenarios including re-births, re-branding, and re-purposing. A success score metric is defined, with the claim that re-branded apps perform approximately 11.3% better than an average top app, while also highlighting concealed security and privacy risks.

Significance. If the detection methodology proves reliable through validation, the work offers a novel empirical lens on app evolution dynamics beyond incremental updates, with potential value for market analysis and security research. The use of real store snapshots and a quantitative success metric provides concrete characterization, though the absence of reported validation metrics limits the strength of the central claims.

major comments (2)

[multi-modal search methodology] The multi-modal search methodology (as described in the abstract) lacks any reported precision, recall, or validation against a labeled ground-truth set. Without this, the identification of true metamorphosis instances between the five-year snapshots risks substantial false positives from name collisions, developer changes, or description drift, directly undermining the reported scenario counts and the 11.3% success-score advantage.
[success score metric] The success score metric is presented as central to evaluating transformation outcomes (e.g., the 11.3% figure for re-branded apps), yet its exact definition, parameters, and computation are not specified. This free parameter makes it impossible to assess whether the performance claims are robust or sensitive to unstated choices in data handling or thresholding.

minor comments (1)

[Abstract] The abstract would benefit from a brief statement on how the two snapshots were obtained and any filtering rules applied to the data.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback, which highlights important areas for strengthening the presentation of our methodology and metrics. We respond to each major comment below and will revise the manuscript accordingly.

read point-by-point responses

Referee: The multi-modal search methodology (as described in the abstract) lacks any reported precision, recall, or validation against a labeled ground-truth set. Without this, the identification of true metamorphosis instances between the five-year snapshots risks substantial false positives from name collisions, developer changes, or description drift, directly undermining the reported scenario counts and the 11.3% success-score advantage.

Authors: We agree that explicit validation metrics would improve the strength of the claims. The multi-modal design (requiring consistency across name, description, icons, and metadata) was intended to reduce false positives from single-modality issues such as name collisions, but the manuscript does not include quantitative precision/recall or a formal ground-truth evaluation. In the revision we will add a new subsection describing a manual validation performed on a random sample of detected cases (reporting inter-rater agreement and estimated precision), together with an explicit discussion of remaining limitations and false-positive risks. revision: yes
Referee: The success score metric is presented as central to evaluating transformation outcomes (e.g., the 11.3% figure for re-branded apps), yet its exact definition, parameters, and computation are not specified. This free parameter makes it impossible to assess whether the performance claims are robust or sensitive to unstated choices in data handling or thresholding.

Authors: We accept that the success score must be fully specified for reproducibility. The metric aggregates normalized changes in ranking, downloads, and ratings relative to category averages, but the manuscript omits the precise formula, weights, and thresholding steps. In the revised version we will insert the complete mathematical definition, all parameter values, and the exact computation that yields the reported 11.3% figure, enabling readers to perform sensitivity checks. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical snapshot comparison with defined metrics

full rationale

The paper defines 'app metamorphosis' as a new phenomenon, proposes a multi-modal search method, applies it to two Google Play snapshots five years apart, and defines a success score metric to quantify outcomes such as the reported 11.3% advantage for re-branded apps. These are operational definitions and direct empirical measurements on observed data; no equations, parameters, or claims reduce by construction to their own inputs, no self-citation chains are load-bearing, and no ansatzes or uniqueness theorems are invoked. The central results rest on the external store data rather than internal redefinition or fitting.

Axiom & Free-Parameter Ledger

1 free parameters · 0 axioms · 0 invented entities

The success score metric is a constructed evaluation tool whose exact formulation is not detailed in the abstract; the definition of metamorphosis itself is a new conceptual framing rather than a derived quantity.

free parameters (1)

success score metric
Defined within the paper to quantify transformation success, with an example result of 11.3% better performance for re-branded apps.

pith-pipeline@v0.9.0 · 5725 in / 1261 out tokens · 29618 ms · 2026-05-23T22:47:55.539575+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 2 internal anchors

[1]

Number of available applications in the google play store from december 2009 to march 2023,

Statista Inc., “Number of available applications in the google play store from december 2009 to march 2023,” 2023. Accessed on May, 2023

work page 2009
[2]

Number of apps available in leading app stores as of 3rd quarter 2022,

Statista Inc., “Number of apps available in leading app stores as of 3rd quarter 2022,” 2022. Accessed on May, 2023

work page 2022
[3]

Short video service musical.ly is merging into sister app tiktok,

TechCrunch, “Short video service musical.ly is merging into sister app tiktok,” 2018. Accessed on May, 2023

work page 2018
[4]

A measurement study of google play,

N. Viennot, E. Garcia, and J. Nieh, “A measurement study of google play,” in The 2014 ACM international conference on Measurement and modeling of computer systems , pp. 221–233, 2014

work page 2014
[5]

Beyond google play: A large-scale comparative study of chinese android app markets,

H. Wang, Z. Liu, J. Liang, N. Vallina-Rodriguez, Y . Guo, L. Li, J. Tapiador, J. Cao, and G. Xu, “Beyond google play: A large-scale comparative study of chinese android app markets,” in Proceedings of the Internet Measurement Conference 2018 , pp. 293–307, 2018

work page 2018
[6]

Following devil’s footprints: Cross-platform analysis of potentially harmful libraries on android and ios,

K. Chen, X. Wang, Y . Chen, P. Wang, Y . Lee, X. Wang, B. Ma, A. Wang, Y . Zhang, and W. Zou, “Following devil’s footprints: Cross-platform analysis of potentially harmful libraries on android and ios,” in 2016 IEEE Symposium on Security and Privacy (SP) , pp. 357–376, IEEE, 2016

work page 2016
[7]

Proactive libraries: enforcing correct behaviors in android apps,

O. Riganelli, I. D. Fagadau, D. Micucci, and L. Mariani, “Proactive libraries: enforcing correct behaviors in android apps,” in Proceedings of the ACM/IEEE 44th International Conference on Software Engineering: Companion Proceedings, pp. 159–163, 2022

work page 2022
[8]

Why an android app is classified as malware: Toward malware classifi- cation interpretation,

B. Wu, S. Chen, C. Gao, L. Fan, Y . Liu, W. Wen, and M. R. Lyu, “Why an android app is classified as malware: Toward malware classifi- cation interpretation,” ACM Transactions on Software Engineering and Methodology (TOSEM), vol. 30, no. 2, pp. 1–29, 2021

work page 2021
[9]

Crowdroid: behavior- based malware detection system for android,

I. Burguera, U. Zurutuza, and S. Nadjm-Tehrani, “Crowdroid: behavior- based malware detection system for android,” in Proceedings of the 1st ACM workshop on Security and privacy in smartphones and mobile devices, pp. 15–26, 2011. 14

work page 2011
[10]

Riskranker: scal- able and accurate zero-day android malware detection,

M. Grace, Y . Zhou, Q. Zhang, S. Zou, and X. Jiang, “Riskranker: scal- able and accurate zero-day android malware detection,” in Proceedings of the 10th international conference on Mobile systems, applications, and services, pp. 281–294, 2012

work page 2012
[11]

“andro- maly

A. Shabtai, U. Kanonov, Y . Elovici, C. Glezer, and Y . Weiss, ““andro- maly”: a behavioral malware detection framework for android devices,” Journal of Intelligent Information Systems , vol. 38, no. 1, pp. 161–190, 2012

work page 2012
[12]

Droidmat: Android malware detection through manifest and api calls tracing,

D.-J. Wu, C.-H. Mao, T.-E. Wei, H.-M. Lee, and K.-P. Wu, “Droidmat: Android malware detection through manifest and api calls tracing,” in 2012 Seventh Asia joint conference on information security , pp. 62–69, IEEE, 2012

work page 2012
[13]

Droid-sec: deep learning in android malware detection,

Z. Yuan, Y . Lu, Z. Wang, and Y . Xue, “Droid-sec: deep learning in android malware detection,” inProceedings of the 2014 ACM conference on SIGCOMM, pp. 371–372, 2014

work page 2014
[14]

An analysis of the privacy and security risks of android vpn permission-enabled apps,

M. Ikram, N. Vallina-Rodriguez, S. Seneviratne, M. A. Kaafar, and V . Paxson, “An analysis of the privacy and security risks of android vpn permission-enabled apps,” in Proceedings of the 2016 internet measurement conference, pp. 349–364, 2016

work page 2016
[15]

Share first, ask later (or never?)-studying violations of gdpr’s explicit consent in android apps,

T. T. Nguyen, M. Backes, N. Marnau, and B. Stock, “Share first, ask later (or never?)-studying violations of gdpr’s explicit consent in android apps,” in USENIX Security Symposium , 2021

work page 2021
[16]

Mixed signals: Analyzing soft- ware attribution challenges in the android ecosystem,

K. Hageman, Á. Feal, J. Gamba, A. Girish, J. Bleier, M. Lindorfer, J. Tapiador, and N. Vallina-Rodriguez, “Mixed signals: Analyzing soft- ware attribution challenges in the android ecosystem,” IEEE Transac- tions on Software Engineering , 2023

work page 2023
[17]

Early detection of spam mobile apps,

S. Seneviratne, A. Seneviratne, M. A. Kaafar, A. Mahanti, and P. Mo- hapatra, “Early detection of spam mobile apps,” in Proceedings of the 24th International Conference on World Wide Web , pp. 949–959, 2015

work page 2015
[18]

A longitudinal study of google play,

R. Potharaju, M. Rahman, and B. Carbunar, “A longitudinal study of google play,”IEEE Transactions on computational social systems, vol. 4, no. 3, pp. 135–149, 2017

work page 2017
[19]

Update behavior in app markets and security implications: A case study in google play,

A. Möller, F. Michahelles, S. Diewald, L. Roalter, and M. Kranz, “Update behavior in app markets and security implications: A case study in google play,” in Research in the Large, LARGE 3.0: 21/09/2012- 21/09/2012, pp. 3–6, 2012

work page 2012
[20]

To update or not to update: Insights from a two-year study of android app evolution,

V . F. Taylor and I. Martinovic, “To update or not to update: Insights from a two-year study of android app evolution,” in Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security , pp. 45–57, 2017

work page 2017
[21]

A longitudinal study of popular ad libraries in the google play store,

M. Ahasanuzzaman, S. Hassan, C.-P. Bezemer, and A. E. Hassan, “A longitudinal study of popular ad libraries in the google play store,” Empirical Software Engineering , vol. 25, pp. 824–858, 2020

work page 2020
[22]

Understanding the evolution of mobile app ecosystems: A longitudinal measurement study of google play,

H. Wang, H. Li, and Y . Guo, “Understanding the evolution of mobile app ecosystems: A longitudinal measurement study of google play,” in The World Wide Web Conference, pp. 1988–1999, 2019

work page 1988
[23]

Understanding the long- term evolution of mobile app usage,

T. Li, Y . Fan, Y . Li, S. Tarkoma, and P. Hui, “Understanding the long- term evolution of mobile app usage,” IEEE Transactions on Mobile Computing, 2021

work page 2021
[24]

Decline in mobile appli- cation life cycle,

A. Vagrani, N. Kumar, and P. V . Ilavarasan, “Decline in mobile appli- cation life cycle,” Procedia Computer Science , vol. 122, pp. 957–964, 2017

work page 2017
[25]

Adrob: Examining the landscape and impact of android application plagiarism,

C. Gibler, R. Stevens, J. Crussell, H. Chen, H. Zang, and H. Choi, “Adrob: Examining the landscape and impact of android application plagiarism,” in Proceeding of the 11th annual international conference on Mobile systems, applications, and services , pp. 431–444, 2013

work page 2013
[26]

Viewdroid: To- wards obfuscation-resilient mobile application repackaging detection,

F. Zhang, H. Huang, S. Zhu, D. Wu, and P. Liu, “Viewdroid: To- wards obfuscation-resilient mobile application repackaging detection,” in Proceedings of the 2014 ACM conference on Security and privacy in wireless & mobile networks , pp. 25–36, 2014

work page 2014
[27]

Android application forensics: A survey of obfuscation, obfuscation detection and deobfuscation techniques and their impact on investigations,

X. Zhang, F. Breitinger, E. Luechinger, and S. O’Shaughnessy, “Android application forensics: A survey of obfuscation, obfuscation detection and deobfuscation techniques and their impact on investigations,” Forensic Science International: Digital Investigation , vol. 39, p. 301285, 2021

work page 2021
[28]

A multi-modal neural embeddings approach for detecting mobile counterfeit apps: A case study on google play store,

N. Karunanayake, J. Rajasegaran, A. Gunathillake, S. Seneviratne, and G. Jourjon, “A multi-modal neural embeddings approach for detecting mobile counterfeit apps: A case study on google play store,” IEEE Transactions on Mobile Computing , vol. 21, no. 1, pp. 16–30, 2020

work page 2020
[30]

Spam mobile apps: Characteristics, detection, and in the wild analysis,

S. Seneviratne, A. Seneviratne, M. A. Kaafar, A. Mahanti, and P. Mo- hapatra, “Spam mobile apps: Characteristics, detection, and in the wild analysis,” ACM Transactions on the Web (TWEB), vol. 11, no. 1, pp. 1– 29, 2017

work page 2017
[31]

Stytr2: Image style transfer with transformers,

Y . Deng, F. Tang, W. Dong, C. Ma, X. Pan, L. Wang, and C. Xu, “Stytr2: Image style transfer with transformers,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pp. 11326–11336, 2022

work page 2022
[32]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929 , 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[33]

An exploration of encoder- decoder approaches to multi-label classification for legal and biomedical text,

Y . Kementchedjhieva and I. Chalkidis, “An exploration of encoder- decoder approaches to multi-label classification for legal and biomedical text,” arXiv preprint arXiv:2305.05627 , 2023

work page arXiv 2023
[34]

Mpnet: Masked and permuted pre-training for language understanding,

K. Song, X. Tan, T. Qin, J. Lu, and T.-Y . Liu, “Mpnet: Masked and permuted pre-training for language understanding,” Advances in Neural Information Processing Systems , vol. 33, pp. 16857–16867, 2020

work page 2020
[35]

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “Bert: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint arXiv:1810.04805, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[36]

Xlnet: Generalized autoregressive pretraining for language understanding,

Z. Yang, Z. Dai, Y . Yang, J. Carbonell, R. R. Salakhutdinov, and Q. V . Le, “Xlnet: Generalized autoregressive pretraining for language understanding,” Advances in neural information processing systems , vol. 32, 2019

work page 2019
[37]

Billion-scale similarity search with gpus,

J. Johnson, M. Douze, and H. Jégou, “Billion-scale similarity search with gpus,” IEEE Transactions on Big Data , vol. 7, no. 3, pp. 535–547, 2019

work page 2019
[38]

Bayesian Optimization: Open source constrained global optimization tool for Python,

F. Nogueira, “Bayesian Optimization: Open source constrained global optimization tool for Python,” 2014–

work page 2014
[39]

Android’s latest statistics 2024: How many people have androids?,

Sam Nguyen, “Android’s latest statistics 2024: How many people have androids?,” September 06, 2023

work page 2024
[40]

Android - statistics & facts,

Ahmed Sherif, “Android - statistics & facts,” January 10, 2024

work page 2024
[41]

Insights into the 2.3 billion android smartphones in use around the world,

“Insights into the 2.3 billion android smartphones in use around the world,” January 13, 2018

work page 2018
[42]

Uno (video game),

“Uno (video game),” 2022. Accessed on May, 2023

work page 2022
[43]

Accessed on May, 2023

“Flickr,” 2022. Accessed on May, 2023

work page 2022
[44]

App miscat- egorization detection: A case study on google play,

D. Surian, S. Seneviratne, A. Seneviratne, and S. Chawla, “App miscat- egorization detection: A case study on google play,” IEEE Transactions on Knowledge and Data Engineering , vol. 29, no. 8, pp. 1591–1604, 2017

work page 2017
[45]

Transfer app to a different developer account,

“Transfer app to a different developer account,” 2023. Accessed on May, 2023

work page 2023
[46]

Kiloo - subway surfers wiki

Subway Surfers Wiki, “Kiloo - subway surfers wiki.” Accessed on May, 2023

work page 2023
[47]

Uraniborg’s device preloaded app risks scoring metrics,

B. Lau, J. Zhang, A. R. Bereford, D. Thomas, and R. Mayrhofer, “Uraniborg’s device preloaded app risks scoring metrics,” Institute of Networks and Security: Linz, Austria , 2020. Dishanika Denipitiyage received her Bachelors de- gree in Electronic and Telecommunication Engineer- ing from University of Moratuwa, Sri Lanka in 2020. She is currently working ...

work page 2020
[48]

Before moving into research, he worked nearly six years in the telecommunications industry in core network plan- ning and operations

His current research interests include privacy and security in mobile systems, AI applications in security, and behavior biometrics. Before moving into research, he worked nearly six years in the telecommunications industry in core network plan- ning and operations. He received his bachelor degree from University of Moratuwa, Sri Lanka in 2005. Anirban Ma...

work page 2005
[49]

He received his PhD from the University of Tennessee (USA) in 1995. His research is in data mining and machine learning with a specialization in spatio-temporal data mining, outlier detection, class imbalanced classification, and adversarial learning

work page 1995

[1] [1]

Number of available applications in the google play store from december 2009 to march 2023,

Statista Inc., “Number of available applications in the google play store from december 2009 to march 2023,” 2023. Accessed on May, 2023

work page 2009

[2] [2]

Number of apps available in leading app stores as of 3rd quarter 2022,

Statista Inc., “Number of apps available in leading app stores as of 3rd quarter 2022,” 2022. Accessed on May, 2023

work page 2022

[3] [3]

Short video service musical.ly is merging into sister app tiktok,

TechCrunch, “Short video service musical.ly is merging into sister app tiktok,” 2018. Accessed on May, 2023

work page 2018

[4] [4]

A measurement study of google play,

N. Viennot, E. Garcia, and J. Nieh, “A measurement study of google play,” in The 2014 ACM international conference on Measurement and modeling of computer systems , pp. 221–233, 2014

work page 2014

[5] [5]

Beyond google play: A large-scale comparative study of chinese android app markets,

H. Wang, Z. Liu, J. Liang, N. Vallina-Rodriguez, Y . Guo, L. Li, J. Tapiador, J. Cao, and G. Xu, “Beyond google play: A large-scale comparative study of chinese android app markets,” in Proceedings of the Internet Measurement Conference 2018 , pp. 293–307, 2018

work page 2018

[6] [6]

Following devil’s footprints: Cross-platform analysis of potentially harmful libraries on android and ios,

K. Chen, X. Wang, Y . Chen, P. Wang, Y . Lee, X. Wang, B. Ma, A. Wang, Y . Zhang, and W. Zou, “Following devil’s footprints: Cross-platform analysis of potentially harmful libraries on android and ios,” in 2016 IEEE Symposium on Security and Privacy (SP) , pp. 357–376, IEEE, 2016

work page 2016

[7] [7]

Proactive libraries: enforcing correct behaviors in android apps,

O. Riganelli, I. D. Fagadau, D. Micucci, and L. Mariani, “Proactive libraries: enforcing correct behaviors in android apps,” in Proceedings of the ACM/IEEE 44th International Conference on Software Engineering: Companion Proceedings, pp. 159–163, 2022

work page 2022

[8] [8]

Why an android app is classified as malware: Toward malware classifi- cation interpretation,

B. Wu, S. Chen, C. Gao, L. Fan, Y . Liu, W. Wen, and M. R. Lyu, “Why an android app is classified as malware: Toward malware classifi- cation interpretation,” ACM Transactions on Software Engineering and Methodology (TOSEM), vol. 30, no. 2, pp. 1–29, 2021

work page 2021

[9] [9]

Crowdroid: behavior- based malware detection system for android,

I. Burguera, U. Zurutuza, and S. Nadjm-Tehrani, “Crowdroid: behavior- based malware detection system for android,” in Proceedings of the 1st ACM workshop on Security and privacy in smartphones and mobile devices, pp. 15–26, 2011. 14

work page 2011

[10] [10]

Riskranker: scal- able and accurate zero-day android malware detection,

M. Grace, Y . Zhou, Q. Zhang, S. Zou, and X. Jiang, “Riskranker: scal- able and accurate zero-day android malware detection,” in Proceedings of the 10th international conference on Mobile systems, applications, and services, pp. 281–294, 2012

work page 2012

[11] [11]

“andro- maly

A. Shabtai, U. Kanonov, Y . Elovici, C. Glezer, and Y . Weiss, ““andro- maly”: a behavioral malware detection framework for android devices,” Journal of Intelligent Information Systems , vol. 38, no. 1, pp. 161–190, 2012

work page 2012

[12] [12]

Droidmat: Android malware detection through manifest and api calls tracing,

D.-J. Wu, C.-H. Mao, T.-E. Wei, H.-M. Lee, and K.-P. Wu, “Droidmat: Android malware detection through manifest and api calls tracing,” in 2012 Seventh Asia joint conference on information security , pp. 62–69, IEEE, 2012

work page 2012

[13] [13]

Droid-sec: deep learning in android malware detection,

Z. Yuan, Y . Lu, Z. Wang, and Y . Xue, “Droid-sec: deep learning in android malware detection,” inProceedings of the 2014 ACM conference on SIGCOMM, pp. 371–372, 2014

work page 2014

[14] [14]

An analysis of the privacy and security risks of android vpn permission-enabled apps,

M. Ikram, N. Vallina-Rodriguez, S. Seneviratne, M. A. Kaafar, and V . Paxson, “An analysis of the privacy and security risks of android vpn permission-enabled apps,” in Proceedings of the 2016 internet measurement conference, pp. 349–364, 2016

work page 2016

[15] [15]

Share first, ask later (or never?)-studying violations of gdpr’s explicit consent in android apps,

T. T. Nguyen, M. Backes, N. Marnau, and B. Stock, “Share first, ask later (or never?)-studying violations of gdpr’s explicit consent in android apps,” in USENIX Security Symposium , 2021

work page 2021

[16] [16]

Mixed signals: Analyzing soft- ware attribution challenges in the android ecosystem,

K. Hageman, Á. Feal, J. Gamba, A. Girish, J. Bleier, M. Lindorfer, J. Tapiador, and N. Vallina-Rodriguez, “Mixed signals: Analyzing soft- ware attribution challenges in the android ecosystem,” IEEE Transac- tions on Software Engineering , 2023

work page 2023

[17] [17]

Early detection of spam mobile apps,

S. Seneviratne, A. Seneviratne, M. A. Kaafar, A. Mahanti, and P. Mo- hapatra, “Early detection of spam mobile apps,” in Proceedings of the 24th International Conference on World Wide Web , pp. 949–959, 2015

work page 2015

[18] [18]

A longitudinal study of google play,

R. Potharaju, M. Rahman, and B. Carbunar, “A longitudinal study of google play,”IEEE Transactions on computational social systems, vol. 4, no. 3, pp. 135–149, 2017

work page 2017

[19] [19]

Update behavior in app markets and security implications: A case study in google play,

A. Möller, F. Michahelles, S. Diewald, L. Roalter, and M. Kranz, “Update behavior in app markets and security implications: A case study in google play,” in Research in the Large, LARGE 3.0: 21/09/2012- 21/09/2012, pp. 3–6, 2012

work page 2012

[20] [20]

To update or not to update: Insights from a two-year study of android app evolution,

V . F. Taylor and I. Martinovic, “To update or not to update: Insights from a two-year study of android app evolution,” in Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security , pp. 45–57, 2017

work page 2017

[21] [21]

A longitudinal study of popular ad libraries in the google play store,

M. Ahasanuzzaman, S. Hassan, C.-P. Bezemer, and A. E. Hassan, “A longitudinal study of popular ad libraries in the google play store,” Empirical Software Engineering , vol. 25, pp. 824–858, 2020

work page 2020

[22] [22]

Understanding the evolution of mobile app ecosystems: A longitudinal measurement study of google play,

H. Wang, H. Li, and Y . Guo, “Understanding the evolution of mobile app ecosystems: A longitudinal measurement study of google play,” in The World Wide Web Conference, pp. 1988–1999, 2019

work page 1988

[23] [23]

Understanding the long- term evolution of mobile app usage,

T. Li, Y . Fan, Y . Li, S. Tarkoma, and P. Hui, “Understanding the long- term evolution of mobile app usage,” IEEE Transactions on Mobile Computing, 2021

work page 2021

[24] [24]

Decline in mobile appli- cation life cycle,

A. Vagrani, N. Kumar, and P. V . Ilavarasan, “Decline in mobile appli- cation life cycle,” Procedia Computer Science , vol. 122, pp. 957–964, 2017

work page 2017

[25] [25]

Adrob: Examining the landscape and impact of android application plagiarism,

C. Gibler, R. Stevens, J. Crussell, H. Chen, H. Zang, and H. Choi, “Adrob: Examining the landscape and impact of android application plagiarism,” in Proceeding of the 11th annual international conference on Mobile systems, applications, and services , pp. 431–444, 2013

work page 2013

[26] [26]

Viewdroid: To- wards obfuscation-resilient mobile application repackaging detection,

F. Zhang, H. Huang, S. Zhu, D. Wu, and P. Liu, “Viewdroid: To- wards obfuscation-resilient mobile application repackaging detection,” in Proceedings of the 2014 ACM conference on Security and privacy in wireless & mobile networks , pp. 25–36, 2014

work page 2014

[27] [27]

Android application forensics: A survey of obfuscation, obfuscation detection and deobfuscation techniques and their impact on investigations,

X. Zhang, F. Breitinger, E. Luechinger, and S. O’Shaughnessy, “Android application forensics: A survey of obfuscation, obfuscation detection and deobfuscation techniques and their impact on investigations,” Forensic Science International: Digital Investigation , vol. 39, p. 301285, 2021

work page 2021

[28] [28]

A multi-modal neural embeddings approach for detecting mobile counterfeit apps: A case study on google play store,

N. Karunanayake, J. Rajasegaran, A. Gunathillake, S. Seneviratne, and G. Jourjon, “A multi-modal neural embeddings approach for detecting mobile counterfeit apps: A case study on google play store,” IEEE Transactions on Mobile Computing , vol. 21, no. 1, pp. 16–30, 2020

work page 2020

[29] [30]

Spam mobile apps: Characteristics, detection, and in the wild analysis,

S. Seneviratne, A. Seneviratne, M. A. Kaafar, A. Mahanti, and P. Mo- hapatra, “Spam mobile apps: Characteristics, detection, and in the wild analysis,” ACM Transactions on the Web (TWEB), vol. 11, no. 1, pp. 1– 29, 2017

work page 2017

[30] [31]

Stytr2: Image style transfer with transformers,

Y . Deng, F. Tang, W. Dong, C. Ma, X. Pan, L. Wang, and C. Xu, “Stytr2: Image style transfer with transformers,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pp. 11326–11336, 2022

work page 2022

[31] [32]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929 , 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[32] [33]

An exploration of encoder- decoder approaches to multi-label classification for legal and biomedical text,

Y . Kementchedjhieva and I. Chalkidis, “An exploration of encoder- decoder approaches to multi-label classification for legal and biomedical text,” arXiv preprint arXiv:2305.05627 , 2023

work page arXiv 2023

[33] [34]

Mpnet: Masked and permuted pre-training for language understanding,

K. Song, X. Tan, T. Qin, J. Lu, and T.-Y . Liu, “Mpnet: Masked and permuted pre-training for language understanding,” Advances in Neural Information Processing Systems , vol. 33, pp. 16857–16867, 2020

work page 2020

[34] [35]

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “Bert: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint arXiv:1810.04805, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[35] [36]

Xlnet: Generalized autoregressive pretraining for language understanding,

Z. Yang, Z. Dai, Y . Yang, J. Carbonell, R. R. Salakhutdinov, and Q. V . Le, “Xlnet: Generalized autoregressive pretraining for language understanding,” Advances in neural information processing systems , vol. 32, 2019

work page 2019

[36] [37]

Billion-scale similarity search with gpus,

J. Johnson, M. Douze, and H. Jégou, “Billion-scale similarity search with gpus,” IEEE Transactions on Big Data , vol. 7, no. 3, pp. 535–547, 2019

work page 2019

[37] [38]

Bayesian Optimization: Open source constrained global optimization tool for Python,

F. Nogueira, “Bayesian Optimization: Open source constrained global optimization tool for Python,” 2014–

work page 2014

[38] [39]

Android’s latest statistics 2024: How many people have androids?,

Sam Nguyen, “Android’s latest statistics 2024: How many people have androids?,” September 06, 2023

work page 2024

[39] [40]

Android - statistics & facts,

Ahmed Sherif, “Android - statistics & facts,” January 10, 2024

work page 2024

[40] [41]

Insights into the 2.3 billion android smartphones in use around the world,

“Insights into the 2.3 billion android smartphones in use around the world,” January 13, 2018

work page 2018

[41] [42]

Uno (video game),

“Uno (video game),” 2022. Accessed on May, 2023

work page 2022

[42] [43]

Accessed on May, 2023

“Flickr,” 2022. Accessed on May, 2023

work page 2022

[43] [44]

App miscat- egorization detection: A case study on google play,

D. Surian, S. Seneviratne, A. Seneviratne, and S. Chawla, “App miscat- egorization detection: A case study on google play,” IEEE Transactions on Knowledge and Data Engineering , vol. 29, no. 8, pp. 1591–1604, 2017

work page 2017

[44] [45]

Transfer app to a different developer account,

“Transfer app to a different developer account,” 2023. Accessed on May, 2023

work page 2023

[45] [46]

Kiloo - subway surfers wiki

Subway Surfers Wiki, “Kiloo - subway surfers wiki.” Accessed on May, 2023

work page 2023

[46] [47]

Uraniborg’s device preloaded app risks scoring metrics,

B. Lau, J. Zhang, A. R. Bereford, D. Thomas, and R. Mayrhofer, “Uraniborg’s device preloaded app risks scoring metrics,” Institute of Networks and Security: Linz, Austria , 2020. Dishanika Denipitiyage received her Bachelors de- gree in Electronic and Telecommunication Engineer- ing from University of Moratuwa, Sri Lanka in 2020. She is currently working ...

work page 2020

[47] [48]

Before moving into research, he worked nearly six years in the telecommunications industry in core network plan- ning and operations

His current research interests include privacy and security in mobile systems, AI applications in security, and behavior biometrics. Before moving into research, he worked nearly six years in the telecommunications industry in core network plan- ning and operations. He received his bachelor degree from University of Moratuwa, Sri Lanka in 2005. Anirban Ma...

work page 2005

[48] [49]

He received his PhD from the University of Tennessee (USA) in 1995. His research is in data mining and machine learning with a specialization in spatio-temporal data mining, outlier detection, class imbalanced classification, and adversarial learning

work page 1995