Supervised Learning for Provenance-Similarity of Binaries

By Teddy Rogers

Find their other files

https://forum.tuts4you.com/files/file/1639-supervised-learning-for-provenance-similarity-of-binaries/

About This File

Understanding, measuring, and leveraging the similarity of binaries (executable code) is a foundational challenge in software engineering. We present a notion of similarity based on provenance two binaries are similar if they are compiled from the same (or very similar) source code with the same (or similar) compilers. Empirical evidence suggests that provenance-similarity accounts for a significant portion of variation in existing binaries, particularly in malware. We propose and evaluate the applicability of classification to detect provenance-similarity. We evaluate a variety of classifiers, and different types of attributes and similarity labeling schemes, on two benchmarks derived from open-source software and malware respectively. We present encouraging results indicating that classification is a viable approach for automated provenance-similarity detection, and as an aid for malware analysts in particular.

https://forum.tuts4you.com/files/file/1639-supervised-learning-for-provenance-similarity-of-binaries/

Followers 0

Previous File On the Semantics of Self-Unpacking Malware Code

Next File Swimming Into Hostile Code

User Feedback

0 Comments

Recommended Comments

There are no comments to display.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign in

Already have an account? Sign in here.

Sign In

Supervised Learning for Provenance-Similarity of Binaries

About This File

User Feedback

Recommended Comments

Create an account or sign in to comment

Create an account

Sign in

Community

Search Engines

Code Search

File Search

Search Engines