clusterdata issues

Question about State machine of batch task and instance

3

As what is given in trace_201708.md, we found that both task and instance all have status of "Waiting". and what is declared is: task -> Waiting: A task in not...

LernaeanHydra

Question about batch jobs input

Hello, In MR, Spark, we are assuming each mapper or reducer handles portion of data. The data size for each map or reduce instance is at most equal to hdfs...

maniaabdi

Memory bandwidth usage value (mem_gps)

5

Glad to see the new trace includes memory bandwidth usage information. I've checked several machine_usage entries and found non-empty values. I'm somehow confused with its description "Normalized to maximum memory...

tomxice

Question about time unit

1

Currently, the time unit used in the traces is in seconds. Could you please provide the traces in finer time unit, e.g., milliseconds? It will help a lot when using...

minhqnguyen

About memory usage of machine_usage.csv

4

After anlalyzing the machine_usage.csv, I found that about 50% of machine memory are used neither by instance nor container, for example: machine_id = 'm_2824' , time_stamp = 461830, all instance...

odingzx

machines

2

Does "Machines" and "Server" mean physical machine here?

liwen-tj

Adding Server Capacity and Attributes

3

I am wondering if machine attributes such as resource capacity and information (num of cores, num of disks, num of CPUs, kernel version, clock speed, eth_speed, architecuture etc. ) can...

yangrenyu

A job contains multiple tasks

1

I don't quite understand " A job contains multiple tasks". Can anyone give an example about what is a job and what is a task? Thanks in advance.

liwen-tj

Resource usage for task can be higher than resources requested

3

There are many tasks in the dataset that utilize more resources than what was requested. For instance, job_id:10771 task_id:66551 has plan_cpu:0.75 [1] from the following entry in _batch_task.csv_: > 6301,6352,10771,66551,137,Terminated,**75**,0.01600704061294748...

jcarreira

Adding description for failures: task and instances

Would it be possible to add a field broadly classifying the root cause of the different failures? Both task failures and instance failures.

Theophilusbenson

clusterdata
clusterdata copied to clipboard

Metadata

Question about State machine of batch task and instance

Question about batch jobs input

Memory bandwidth usage value (mem_gps)

Question about time unit

About memory usage of machine_usage.csv

machines

Adding Server Capacity and Attributes

A job contains multiple tasks

Resource usage for task can be higher than resources requested

Adding description for failures: task and instances

← Metadata

Owner

Metadata

clusterdata clusterdata copied to clipboard

Metadata

← Metadata

Owner

Metadata

clusterdata
clusterdata copied to clipboard