Detail publikace

Low Overhead Distributed IP Flow Records Collection and Analysis

WRONA, J. ŽÁDNÍK, M.

Originální název

Low Overhead Distributed IP Flow Records Collection and Analysis

Typ

abstrakt

Jazyk

angličtina

Originální abstrakt

Collection and analysis of IP flow records are data-intensive tasks for which the power of a single node may not be sufficient. Several Hadoop-based solutions to this problem exist, but those are usually suitable only for truly big data, otherwise, disadvantages of Hadoop may prevail. In this work, we presented a distributed platform with significantly less overhead, focusing on smaller clusters, preserving interactivity of the centralized system while exploiting the prospects of the distributed system like high availability, parallel processing, scalability or redundancy. Experiments showed great scalability of both storage and query performance. Extensions for data mining and machine learning are easy to include and are already work in progress, moreover, the whole software stack is open-source.

Klíčová slova

NetFlow, IPFIX, IP flow collector, distributed system, parallel computing, Hadoop, big data

Autoři

WRONA, J.; ŽÁDNÍK, M.

Vydáno

21. 8. 2017

Místo

Los Angeles

Strany počet

2

BibTex

@misc{BUT170109,
  author="Jan {Wrona} and Martin {Žádník}",
  title="Low Overhead Distributed IP Flow Records Collection and Analysis",
  booktitle="SIGCOMM '17: Proceedings of the 2017 ACM SIGCOMM Conference",
  year="2017",
  pages="2",
  address="Los Angeles",
  note="abstract"
}