Detecting DNS Tunnels Using Character Frequency Analysis

David Gustafson; Kenton Born

arxiv: 1004.4358 · v1 · submitted 2010-04-25 · 💻 cs.CR

Detecting DNS Tunnels Using Character Frequency Analysis

Kenton Born , David Gustafson This is my paper

classification 💻 cs.CR

keywords charactertunnelsdomainstrafficdetectingdomainfrequenciesfrequency

0 comments

read the original abstract

High-bandwidth covert channels pose significant risks to sensitive and proprietary information inside company networks. Domain Name System (DNS) tunnels provide a means to covertly infiltrate and exfiltrate large amounts of information passed network boundaries. This paper explores the possibility of detecting DNS tunnels by analyzing the unigram, bigram, and trigram character frequencies of domains in DNS queries and responses. It is empirically shown how domains follow Zipf's law in a similar pattern to natural languages, whereas tunneled traffic has more evenly distributed character frequencies. This approach allows tunnels to be detected across multiple domains, whereas previous methods typically concentrate on monitoring point to point systems. Anomalies are quickly discovered when tunneled traffic is compared to the character frequency fingerprint of legitimate domain traffic.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Identifying DNS-tunneled traffic with predictive models
cs.CR 2019-06 unverdicted novelty 3.0

Pairing DNS queries and responses in feature extraction raises MLP and Random Forest accuracy above 83% for detecting SSH/SFTP/Telnet tunnels, with roughly 95% reduction in data size.