mars
mars copied to clipboard
[BUG] Mars storage put_data_info hang
Describe the bug A clear and concise description of what the bug is.
To Reproduce To help us reproducing this bug, please provide information below:
-
Your Python version: 3.7.9
-
The version of Mars you use: master
-
Versions of crucial packages, such as numpy, scipy and pandas:
-
Full stack of the error. 2022-04-02 14:04:19,835 WARNING debug.py:81 -- Message that client sent to actor ray://mars_cluster_1648879130/4/0 is SendMessage(actor_ref=ActorRef(uid=b'DataManagerActor', address='ray://mars_cluster_1648879130/4/0'), content=('put_data_info', 1, ([('QHwF6b4ZKIC4raH4Ay5SJCLl', ('61a5855f089c0d2df5615a4be097b29b_0', (0, 0)), DataInfo(object_id=ObjectRef(d36969e3bab0432009e38f4a072d00192f08d8a1e700008001000000), level=<StorageLevel.MEMORY: 2>, memory_size=303001, store_size=303001, band='numa-0'), ObjectInfo(size=None, device=None, object_id=ObjectRef(d36969e3bab0432009e38f4a072d00192f08d8a1e700008001000000))), ('QHwF6b4ZKIC4raH4Ay5SJCLl', ('61a5855f089c0d2df5615a4be097b29b_0', (1, 0)), DataInfo(object_id=ObjectRef(d36969e3bab0432009e38f4a072d00192f08d8a1e700008002000000), level=<StorageLevel.MEMORY: 2>, memory_size=152734, store_size=152734, band='numa-0'), ObjectInfo(size=None, device=None, object_id=ObjectRef(d36969e3bab0432009e38f4a072d00192f08d8a1e700008002000000))), ('QHwF6b4ZKIC4raH4Ay5SJCLl', ('61a5855f089c0d2df5615a4be097b29b_0', (2, 0)),

-
Minimized code to reproduce the error.
Expected behavior
Call put_data_info is a local call in same node, it shouldn't hang and timeout. Also there ara plenty of info duplicate in the aruguments and it should be avoided.
Additional context Add any other context about the problem here.