nPrint Project Datasets

Traffic analysis datasets encoded using pcapML

Overview

The nPrint Project has released datasets in an effort to encourage more reproducible network traffic analysis. These datasets were used in the original nPrint research paper and are encoded using pcapML.

Access

Download: Google Drive

Each dataset includes:

  • Usage information
  • Original citation details
  • pcapML encoding

License

Distributed under Apache 2.0 License

Documentation

Citation

If you use these datasets, please cite:

@inproceedings{holland2021new,
  author = {Holland, Jordan and Schmitt, Paul and Feamster, Nick and Mittal, Prateek},
  title = {New Directions in Automated Traffic Analysis},
  year = {2021},
  isbn = {9781450384544},
  publisher = {Association for Computing Machinery},
  doi = {10.1145/3460120.3484758},
  booktitle = {Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security},
  pages = {3366–3383}
}