Deep Learning vs. Traditional Probabilistic Models: Case Study on Short Inputs for Password Guessing

  • Yuan Linghu
  • , Xiangxue Li*
  • , Zhenlong Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

The paper focuses on the comparative analysis of deep learning algorithms and traditional probabilistic models on strings of short lengths (typically, passwords). The password is one of the dominant methods used in user authentication. Compared to the traditional brute-force attack and dictionary attack, password guessing models use the leaked password datasets to generate password guesses, expecting to cover as many accounts as possible while minimizing the number of guesses. In this paper, we analyze the password pattern of leaked datasets and further present a comparative study on two dominant probabilistic models (i.e., Markov-based model and Probabilistic Context-Free Grammars (PCFG) based model) and the PassGAN model (which is a representative deep-learning-based method). We use Laplace smoothing for the Markov model and introduce particular semantic patterns to the PCFG model. Our output shows that the Markov-based models can cover the vast majority of the passwords in the test set and PassGAN demonstrates surprisingly the worst effect. Nevertheless, considering the threat that an attacker may adjust the training set, the PCFG model is better than the Markov model. Using Passcode with high-frequency passwords can increase the coverage while reducing the number of guesses. Brute-force attack can also work better when used in conjunction with probabilistic models. For the same billion guesses, brute-force attack can be used to crack pure digital passwords of 4 to 8 lengths, and original-PCFG and modified-PCFG could increase by 11.16% and 8.69%, respectively.

Original languageEnglish
Title of host publicationAlgorithms and Architectures for Parallel Processing - 19th International Conference, ICA3PP 2019, Proceedings
EditorsSheng Wen, Albert Zomaya, Laurence T. Yang
PublisherSpringer
Pages468-483
Number of pages16
ISBN (Print)9783030389901
DOIs
StatePublished - 2020
Event19th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2019 - Melbourne, Australia
Duration: 9 Dec 201911 Dec 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11944 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2019
Country/TerritoryAustralia
CityMelbourne
Period9/12/1911/12/19

Keywords

  • Deep learning
  • Markov model
  • PCFG
  • PassGAN
  • Password guessing

Fingerprint

Dive into the research topics of 'Deep Learning vs. Traditional Probabilistic Models: Case Study on Short Inputs for Password Guessing'. Together they form a unique fingerprint.

Cite this