XMem icon indicating copy to clipboard operation
XMem copied to clipboard

Question about memory consolidation methods

Open AbbeGo opened this issue 1 year ago • 4 comments
trafficstars

This is a result figure from an ablation experiment in your 'XMem' paper. I would like to ask how you conducted the comparison with the Random method. Do you have any hypotheses on why Random performs better than K-means? Additionally, the Random method does not perform significantly better than the Usage-based method. Looking forward to your response. image

AbbeGo avatar Nov 06 '24 03:11 AbbeGo

Hello, thank you for the question!

We do find random selection to be surprisingly effective. One hypothesis is that the more "used" memory locations are also over-represented (e.g., as large objects or background regions) so a random selection is likely to pick them as well. K-means is known to struggle in higher dimensions. Combined with the intuition above, it might under-represent those large clusters by assigning only one centroid and selecting more isolated points.

hkchengrex avatar Nov 08 '24 21:11 hkchengrex

Thank you for your response, which has given me a new understanding of the memory mechanism.

AbbeGo avatar Nov 09 '24 15:11 AbbeGo

I would like to ask you one more question: In your comparative experiments on different long memory mechanisms, were the experiments conducted using only long memory, or were the results in the experimental table obtained with sensory memory and work memory also included? I am particularly interested in whether you have ever used only long memory for video tracking and how the results compared. I look forward to your response.

AbbeGo avatar Dec 17 '24 03:12 AbbeGo

The other types of memory remain in use. Using long-term memory only is probably going to degrade performance by too much.

hkchengrex avatar Dec 17 '24 05:12 hkchengrex