PowerGraph icon indicating copy to clipboard operation
PowerGraph copied to clipboard

printlock.lock() mutex assertion when using USE_TRACEPOINT performance monitoring

Open dbickson opened this issue 11 years ago • 1 comments

I am getting the following error: dc_call_dispatch: dc: time spent issuing RPC calls Events: 1262 Total: 671.31 ms Mean: 0.531941 ms Min: 0.0446786 ms Max: 0.797284 ms dc_receive_multiplexing: dc: time spent exploding a chunk Events: 0 Total: 0 ms [Thread 0x7fffefe9f700 (LWP 10122) exited] [Thread 0x7fffeee9d700 (LWP 10124) exited] [Thread 0x7fffee69c700 (LWP 10125) exited] [Thread 0x7fffef69e700 (LWP 10123) exited] ERROR: mutex.hpp(lock:69): Check failed: !error

Program received signal SIGABRT, Aborted. 0x00007ffff5965425 in raise () from /lib/x86_64-linux-gnu/libc.so.6 (gdb) where #0 0x00007ffff5965425 in raise () from /lib/x86_64-linux-gnu/libc.so.6 #1 0x00007ffff5968b8b in abort () from /lib/x86_64-linux-gnu/libc.so.6 #2 0x000000000076b7bf in graphlab::mutex::lock (this=0xd22f00) at /home/ubuntu/graphlab/src/graphlab/parallel/mutex.hpp:69 #3 0x00000000009043ad in graphlab::trace_count::~trace_count (this=0xd205a0, __in_chrg=) at /home/ubuntu/graphlab/src/graphlab/util/tracepoint.cpp:65 #4 0x00007ffff596a901 in ?? () from /lib/x86_64-linux-gnu/libc.so.6 #5 0x00007ffff596a985 in exit () from /lib/x86_64-linux-gnu/libc.so.6 #6 0x00007ffff5950774 in __libc_start_main () from /lib/x86_64-linux-gnu/libc.so.6 #7 0x000000000075b4c9 in _start ()

(gdb) f 3 #3 0x00000000009043ad in graphlab::trace_count::~trace_count (this=0xd205a0, __in_chrg=) at /home/ubuntu/graphlab/src/graphlab/util/tracepoint.cpp:65

65 printlock.lock();

It seems that the printlock is already locked - or the mutex is in error state from some other reason.

The way to reproduce this error is to enable USE_TRACEPOINT and run ./svd smallnetflix/ --rows=95527 --cos=3562 --nv=6 --nsv=2 --max_iter=2= where the folder smallnetflix includes the smallnetflix_mm.train input file.

Thanks!

dbickson avatar Oct 25 '13 18:10 dbickson

Additional information: When printing out the error using perror I am getting "No such file or directory". When I comment the printlock.lock()/unlock() lines, everything works fine

dbickson avatar Oct 25 '13 19:10 dbickson