Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms

PDF Version Also Available for Download.

Description

This research methodology isolates coding properties and identifies the probability of security vulnerabilities using machine learning and historical data. Several approaches characterize the effectiveness of detecting security-related bugs that manifest as vulnerabilities, but none utilize vulnerability patch information. The main contribution of this research is a framework to analyze LLVM Intermediate Representation Code and merging core source code representations using source code properties. This research is beneficial because it allows source programs to be transformed into a graphical form and users can extract specific code properties related to vulnerable functions. The result is an improved approach to detect, identify, and … continued below

Physical Description

x, 120 pages

Creation Information

Mayo, Quentin R December 2018.

Context

This dissertation is part of the collection entitled: UNT Theses and Dissertations and was provided by the UNT Libraries to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 42 times. More information about this dissertation can be viewed below.

Author

Mayo, Quentin R

Chairs

Bryce, Renee Major Professor
Dantu, Ram Co-Major Professor

Committee Members

Publisher

University of North Texas
Publisher Info: www.unt.edu

Place of Publication: Denton, Texas

Rights Holder

For guidance see Citations, Rights, Re-Use.

Mayo, Quentin R

Provided By

UNT Libraries

The UNT Libraries serve the university and community by providing access to physical and online collections, fostering information literacy, supporting academic research, and much, much more.

Degree Information

Name: Doctor of Philosophy
Level: Doctoral
Department: Department of Computer Science and Engineering
College: College of Engineering
Discipline: Computer Science and Engineering
PublicationType: Doctoral Dissertation
Grantor: University of North Texas

Description

This research methodology isolates coding properties and identifies the probability of security vulnerabilities using machine learning and historical data. Several approaches characterize the effectiveness of detecting security-related bugs that manifest as vulnerabilities, but none utilize vulnerability patch information. The main contribution of this research is a framework to analyze LLVM Intermediate Representation Code and merging core source code representations using source code properties. This research is beneficial because it allows source programs to be transformed into a graphical form and users can extract specific code properties related to vulnerable functions. The result is an improved approach to detect, identify, and track software system vulnerabilities based on a performance evaluation. The methodology uses historical function level vulnerability information, unique feature extraction techniques, a novel code property graph, and learning algorithms to minimize the amount of end user domain knowledge necessary to detect vulnerabilities in applications. The analysis shows approximately 99% precision and recall to detect known vulnerabilities in the National Institute of Standards and Technology (NIST) Software Assurance Metrics and Tool Evaluation (SAMATE) project. Furthermore, 72% percent of the historical vulnerabilities in the OpenSSL testing environment were detected using a linear support vector classifier (SVC) model.

Physical Description

x, 120 pages

Subjects

Keywords

Library of Congress Subject Headings

Language

English

Item Type

Thesis or Dissertation

Identifier

Unique identifying numbers for this dissertation in the Digital Library or other systems.

Accession or Local Control No: submission_1423
Digital Object Identifier: https://doi.org/10.12794/metadc1404548
Archival Resource Key: ark:/67531/metadc1404548

Collections

This dissertation is part of the following collection of related materials.

UNT Theses and Dissertations

Theses and dissertations represent a wealth of scholarly and artistic content created by masters and doctoral students in the degree-seeking process. Some ETDs in this collection are restricted to use by the UNT community.

What responsibilities do I have when using this dissertation?

Creation Date

December 2018

Added to The UNT Digital Library

Jan. 19, 2019, 9:34 p.m.

Description Last Updated

Feb. 14, 2025, 1:57 p.m.

Usage Statistics

When was this dissertation last used?

Yesterday: 0

Past 30 days: 0

Total Uses: 42

Interact With This Dissertation

Here are some suggestions for what to do next.

Start Reading

Thumbnail image of item number 1 in: 'Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms'.

Thumbnail image of item number 2 in: 'Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms'.

Thumbnail image of item number 3 in: 'Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms'.

Thumbnail image of item number 4 in: 'Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms'.

PDF Version Also Available for Download.

All Formats

Citations, Rights, Re-Use

International Image Interoperability Framework

We support the IIIF Presentation API

Links for Robots

Helpful links in machine-readable formats.

Detection of Generalizable Clone Security Coding Bugs Using Graphs and Learning Algorithms

Description

Physical Description

Creation Information

Context

Who

Author

Chairs

Committee Members

Publisher

Rights Holder

Provided By

UNT Libraries

Contact Us

What

Degree Information

Description

Physical Description

Subjects

Keywords

Library of Congress Subject Headings

Language

Item Type

Identifier

Collections

UNT Theses and Dissertations

Digital Files

When

Creation Date

Added to The UNT Digital Library

Description Last Updated

Usage Statistics

Interact With This Dissertation

Search Inside

Start Reading

Citations, Rights, Re-Use

International Image Interoperability Framework

Print / Share

Links for Robots

Archival Resource Key (ARK)

International Image Interoperability Framework (IIIF)

Metadata Formats

Images

URLs

Stats