Process Mining Google Borg Trace Data

Â

An interesting use case where we are leveraging the mindzie studio to process mine Google Borg trace data to identify bottlenecks and improve capacity allocation and scheduling decisions. The data provides information about 8 different Borg cells. It includes CPU usage, information about job to resource allocations, and job-parent information for MapReduce jobs. It is fully anonymized and does not contain any user information. The data can be obtained here:

https://github.com/google/cluster-data/blob/master/ClusterData2019.md