clusterdata
clusterdata copied to clipboard
cluster data collected from production clusters in Alibaba for cluster management research
尊敬的阿里巴巴研究人员们: Hello. 首先感谢所公开的数据集,让学术界能从数值上了解数据中心网络。 通过处理alibaba trace 2018数据,我发现里面的 net_in 和 net_out 数值是单调递增的,考虑原始数据应该是通过类似 python psutil 的工具提取的流量数据累积值。数据被归一化后,其实际 瞬时流量的归一化值 = (后一个值-前一个值)/时间戳差。 想请教一下数据归一化时用的是什么值?或者大约是10的多少次幂级别? 谢谢。 我猜测它应该是 未归一化之前 net_in 和 net_out 里面的最大值,应该是相当大的一个数。 如果缺少这个值,net_in 和 net_out里面的数据较难使用。 Sincerely, 非常感谢。 Allen...
Hi! I am trying to understand "timestamp" in the microservice data. The paper mentioned that these traces were collected from a 7-day period. However, the timestamp is explained in the...
Hi, Thanks a lot collecting all the prior traces and making them publicly available to experiment with! I am wondering whether you have more information about when the AMTrace 2022...
根据文中的说法 ` MS_CallGraph_Table: MS Call Graphs information. Due to the large-scale data size, we sample the call graph based on the rate of 0.5%. It contains about more than twenty...
当前,machine_usage中使用的时间单位为秒,但取样间隔不同,每天的数据量相差很大,请问数据采集时,这个时间间隔有什么特殊意义吗?或者是按什么标准采集的。非常感谢您发布这些跟踪
Hello, I have been trying to download the 2018 traces, but I get the following error when using oversea links: 
怎么把时间戳和真实采集数据的时间相对应呢? 
For the Microservices 2021 dataset, in the `MS_CallGraph_Table` there are 5 different types on the `rpctype` column: - `rpc` : rpc caller - `http`: http caller - `db`: database calls...
Hi. Thank you for sharing the cluster data. They are a great help!!! I've been looking through the 2018 traces, and intend to use them for simulation. However, I've come...
I am wondering if there is any way to get the following data (I did not find these readily available in the 2021 traces): * All the utilization metrics (CPU...