Skip to main content
Article
Can We Learn What People Are Doing From Raw DNS Queries?
Proceedings - IEEE INFOCOM
  • Jianfeng Li
  • Xiaobo Ma
  • Li Guodong
  • Xiapu Luo
  • Junjie Zhang, Wright State University - Main Campus
  • Wei Li
Document Type
Conference Proceeding
Publication Date
10-8-2018
Disciplines
Abstract

Domain Name System (DNS) is one of the pillars of today's Internet. Due to its appealing properties such as low data volume, wide-ranging applications and encryption free, DNS traffic has been extensively utilized for network monitoring. Most existing studies of DNS traffic, however, focus on domain name reputation. Little attention has been paid to understanding and profiling what people are doing from DNS traffic, a fundamental problem in the areas including Internet demographics and network behavior analysis. Consequently, simple questions like 'How to determine whether a DNS query for www.google.com means searching or any other behaviors?' cannot be answered by existing studies. In this paper, we take the first step to identify user activities from raw DNS queries. We advance a multiscale hierarchical framework to tackle two practical challenges, i.e., behavior ambiguity and behavior polymorphism. Under this framework, a series of novel methods, such as pattern upward mapping and multi-scale random forest classifier, are proposed to characterize and identify user activities of interest. Evaluation using both synthetic and real-world DNS traces demonstrates the effectiveness of our method.

DOI
10.1109/INFOCOM.2018.8486210
Citation Information
Jianfeng Li, Xiaobo Ma, Li Guodong, Xiapu Luo, et al.. "Can We Learn What People Are Doing From Raw DNS Queries?" Proceedings - IEEE INFOCOM (2018) p. 2240 - 2248 ISSN: 0743166X
Available at: http://works.bepress.com/junjie_zhang/17/