DeepFuzz: Automatic Generation of Syntax Valid C Programs for Fuzz Testing

Xiao Liu; Xiaoting Li; Rupesh Prajapati; Dinghao Wu

doi:10.1609/aaai.v33i01.33011044

Authors

Xiao Liu Pennsylvania State University
Xiaoting Li Pennsylvania State University
Rupesh Prajapati Pennsylvania State University
Dinghao Wu Pennsylvania State University

DOI:

https://doi.org/10.1609/aaai.v33i01.33011044

Abstract

Compilers are among the most fundamental programming tools for building software. However, production compilers remain buggy. Fuzz testing is often leveraged with newlygenerated, or mutated inputs in order to find new bugs or security vulnerabilities. In this paper, we propose a grammarbased fuzzing tool called DEEPFUZZ. Based on a generative Sequence-to-Sequence model, DEEPFUZZ automatically and continuously generates well-formed C programs. We use this set of new C programs to fuzz off-the-shelf C compilers, e.g., GCC and Clang/LLVM. We present a detailed case study to analyze the success rate and coverage improvement of the generated C programs for fuzz testing. We analyze the performance of DEEPFUZZ with three types of sampling methods as well as three types of generation strategies. Consequently, DEEPFUZZ improved the testing efficacy in regards to the line, function, and branch coverage. In our preliminary study, we found and reported 8 bugs of GCC, all of which are actively being addressed by developers.

DeepFuzz: Automatic Generation of Syntax Valid C Programs for Fuzz Testing

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription