Process Mining Google Borg Trace Data

Process Mining Google Borg Trace Data

 

An interesting use case where we are leveraging the mindzie studio to process mine Google Borg trace data to identify bottlenecks and improve capacity allocation and scheduling decisions. The data provides information about 8 different Borg cells. It includes CPU usage, information about job to resource allocations, and job-parent information for MapReduce jobs. It is fully anonymized and does not contain any user information. The data can be obtained here:

https://github.com/google/cluster-data/blob/master/ClusterData2019.md

Arik Senderovich, PhD mindzie

Related Posts
Leave a Reply

Your email address will not be published.Required fields are marked *