Resources-Allocation-in-The-Edge-Computing-Environment-Using-Reinforcement-Learning icon indicating copy to clipboard operation
Resources-Allocation-in-The-Edge-Computing-Environment-Using-Reinforcement-Learning copied to clipboard

some question about parameter "offloading target"

Open uzmaru opened this issue 4 years ago • 2 comments

Hi,david.Thank you for sharing code.But I have a question about parameter "offloading target". In Reinforcement learning, offloading target exists in both state and action.It's so confusing.May you explain it detailedly?Thanks.

uzmaru avatar Jul 14 '21 08:07 uzmaru

Yes. I have same confuse too. And more. the offloading target is discrete variable, but DDPG only use for continuous action. Or you set the offloading target based on computing resource and bandwidth allocation? Example Bandwidth = 0 => offload = 0 (no offload). As I saw in source code, not this way to explain. I really confuse because, if this DDPG use for discrete, it's still a DDPG?

HuongDM1896 avatar Mar 21 '23 07:03 HuongDM1896

I have the answer to my question, It is softmax, you used the softmax to define the probability of offloading the target. It is a good solution to make the value of action continues.

HuongDM1896 avatar Jun 20 '23 08:06 HuongDM1896