StreamSpot Data

http://www3.cs.stonybrook.edu/~emanzoor/streamspot/

This repository contains the ALL dataset, which includes edges from all the 600 benign and attack scenario graphs. The YDC and GFC datasets can be derived from ALL by picking graph ID's having scenarios as follows:

YDC: YouTube, Download, CNN
GFC: GMail, VGame, CNN

Format

Tab-separated file with one edge on each line in the following format:

source-id	source-type	destination-id	destination-type	edge-type	graph-id

Graph ID's correspond to scenarios as follows:

YouTube (graph ID's 0 - 99)
GMail (graph ID's 100 - 199)
VGame (graph ID's 200 - 299)
Drive-by-download attack (graph ID's 300 - 399)
Download (graph ID's 400 - 499)
CNN (graph ID's 500 - 599)

Construction

The ALL dataset was extracted from the raw flow-graph data using preprocess.py, which performs the following:

Each node and edge type is mapped to a single character.
Consecutive edges between the same pair of nodes corresponding to block-by-block file reads are collapsed into a single edge.
Node ID's are incremented by 1 (so ID's of -1 become 0).
The timestamp field is removed (raw edges are sorted by timestamp).

preprocess.py is run as: python preprocess.py <raw edges file>

Credits

The raw flow-graph data for StreamSpot was provided by Venkat N. Venkatakrishnan, Sadegh Momeni and team at the University of Illinois - Chicago.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
all.tar.gz		all.tar.gz
preprocess.py		preprocess.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StreamSpot Data

Format

Construction

Credits

Contact

About

Releases

Packages

Languages

License

sbustreamspot/sbustreamspot-data

Folders and files

Latest commit

History

Repository files navigation

StreamSpot Data

Format

Construction

Credits

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages