Automated static code analysis for classifying android applications using machine learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

136 Scopus citations

Abstract

In this paper we apply Machine Learning (ML) techniques on static features that are extracted from Android's application files for the classification of the files. Features are extracted from Android's Java byte-code (i.e., .dex files) and other file types such as XML-files. Our evaluation focused on classifying two types of Android applications: tools and games. Successful differentiation between games and tools is expected to provide positive indication about the ability of such methods to learn and model Android benign applications and potentially detect malware files. The results of an evaluation, performed using a test collection comprising 2,285 Android .apk files, indicate that features, extracted statically from .apk files, coupled with ML classification algorithms can provide good indication about the nature of an Android application without running the application, and may assist in detecting malicious applications. This method can be used for rapid examination of Android .apks and informing of suspicious applications.

Original languageEnglish
Title of host publicationProceedings - 2010 International Conference on Computational Intelligence and Security, CIS 2010
Pages329-333
Number of pages5
DOIs
StatePublished - 1 Dec 2010
Event2010 International Conference on Computational Intelligence and Security, CIS 2010 - Nanning, China
Duration: 11 Dec 201014 Dec 2010

Publication series

NameProceedings - 2010 International Conference on Computational Intelligence and Security, CIS 2010

Conference

Conference2010 International Conference on Computational Intelligence and Security, CIS 2010
Country/TerritoryChina
CityNanning
Period11/12/1014/12/10

Keywords

  • Android
  • Machine learning
  • Malware
  • Mobile devices
  • Security
  • Static analysis

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Automated static code analysis for classifying android applications using machine learning'. Together they form a unique fingerprint.

Cite this