AliceO2 icon indicating copy to clipboard operation
AliceO2 copied to clipboard

DPL: allow resetting the oldest possible timeframe mechanism

Open ktf opened this issue 2 years ago • 21 comments

ktf avatar Sep 21 '22 22:09 ktf

Error while checking build/O2/fullCI for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-10-02 20:45:

## sw/BUILD/o2checkcode-latest/log
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/Common/test/testGPUsortCUDA.cu:22:10: error: 'boost/test/unit_test.hpp' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUTracking/TRDTracking/GPUTRDTracker.cxx:37:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUTracking/Base/GPUReconstruction.cxx:37:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUTracking/Base/GPUReconstructionCPU.cxx:45:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUTracking/display/GPUDisplay.cxx:36:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUTracking/Base/cuda/GPUReconstructionCUDAGenRTC.cu:16:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/cuda/../Shared/Utils.h:26:10: error: 'boost/program_options.hpp' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/cuda/../Shared/Utils.h:26:10: error: 'boost/program_options.hpp' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Logger/include/Framework/Logger.h:14:10: error: 'fairlogger/Logger.h' file not found [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:520:12: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1059:69: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1175:5: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1852:16: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:2276:18: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:4336:16: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/sm_20_atomic_functions.h:89:39: error: redefinition of 'atomicAdd' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/../Shared/Utils.h:146:34: error: use of undeclared identifier 'int4'; did you mean 'int'? [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/benchmark.hip.cxx:199:35: error: use of undeclared identifier 'int4'; did you mean 'int'? [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/../Shared/Utils.h:146:34: error: use of undeclared identifier 'int4'; did you mean 'int'? [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:64:8: error: unknown type name '__host__' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:64:27: error: expected ';' after top level declarator [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:64:33: error: overloaded 'operator+=' must have at least one parameter of class or enumeration type [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:64:44: error: unknown type name 'int4'; did you mean 'int'? [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:64:53: error: unknown type name 'int4'; did you mean 'int'? [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:66:4: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:66:11: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:67:4: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:67:11: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:68:4: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:68:11: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:69:4: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:69:11: error: member reference base type 'int' is not a structure or union [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:85:1: error: unknown type name '__global__' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:90:19: error: use of undeclared identifier 'blockIdx' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:90:32: error: use of undeclared identifier 'blockDim' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:90:45: error: use of undeclared identifier 'threadIdx' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/GPU/GPUbenchmark/hip/Kernels.hip.cxx:90:78: error: use of undeclared identifier 'blockDim' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Logger/include/Framework/Logger.h:14:10: error: 'fairlogger/Logger.h' file not found [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:520:12: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1059:69: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1175:5: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:1852:16: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:2276:18: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/cuda/std/detail/libcxx/include/type_traits:4336:16: error: CUDA device code does not support variadic functions [clang-diagnostic-error]
/usr/local/cuda-11.7/include/sm_20_atomic_functions.h:89:39: error: redefinition of 'atomicAdd' [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Detectors/EMCAL/calibration/include/EMCALCalibration/EMCALCalibExtractor.h:36:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Detectors/EMCAL/calibration/include/EMCALCalibration/EMCALCalibExtractor.h:36:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Detectors/EMCAL/calibration/include/EMCALCalibration/EMCALCalibExtractor.h:36:10: error: 'omp.h' file not found [clang-diagnostic-error]
/sw/SOURCES/O2/9895-slc8_x86-64/0/Detectors/TOF/calibration/src/TOFChannelCalibrator.cxx:23:10: error: 'omp.h' file not found [clang-diagnostic-error]

Full log here.

alibuild avatar Sep 22 '22 07:09 alibuild

Error while checking build/O2/o2-dataflow for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-10-05 22:46:

## sw/BUILD/O2-latest/log
100% tests passed, 0 tests failed out of 434
100% tests passed, 0 tests failed out of 102


## sw/BUILD/QualityControl-latest/log
20/38 Test #25: testCheckWorkflow .......................***Failed    7.36 sec
[2821:qc-task-TST-XYZTask]: 2022-10-05 20:44:37.265651 !!! [1;31mError - Could not find the DPL InfoLogger[0m
[2822:qc-task-TST-newName]: 2022-10-05 20:44:37.266677 !!! [1;31mError - Could not find the DPL InfoLogger[0m
[2823:qc-check-sink-QTST_QYZTask_0]: 2022-10-05 20:44:37.267191 !!! [1;31mError - Could not find the DPL InfoLogger.[0m
[2824:qc-check-sink-QTST_XYZTask_0]: 2022-10-05 20:44:37.268394 !!! [1;31mError - Could not find the DPL InfoLogger.[0m
[2825:qc-check-TST-u2F0]: 2022-10-05 20:44:37.270424 !!! [1;31mError - Could not find the DPL InfoLogger.[0m
[2826:qc-check-TST-CheckSeparately]: 2022-10-05 20:44:37.271006 !!! [1;31mError - Could not find the DPL InfoLogger.[0m
[2820:qc-task-TST-QYZTask]: 2022-10-05 20:44:37.275664 !!! [1;31mError - Could not find the DPL InfoLogger[0m
[2827:qc-check-TST-XYZCheck]: 2022-10-05 20:44:37.274220 !!! [1;31mError - Could not find the DPL InfoLogger.[0m
[ERROR] pid 2828 (Receiver) crashed with 1
[ERROR] SEVERE: Device Receiver (2828) returned with 1
97% tests passed, 1 tests failed out of 38

Full log here.

alibuild avatar Sep 22 '22 23:09 alibuild

Error while checking build/O2/o2-dataflow-cs8 for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-10-05 04:17:

No log files found

Full log here.

alibuild avatar Sep 24 '22 01:09 alibuild

Error while checking build/O2/o2 for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-10-01 05:47:

## sw/BUILD/O2-latest/log
100% tests passed, 0 tests failed out of 457
100% tests passed, 0 tests failed out of 102


## sw/BUILD/QualityControl-latest/log
100% tests passed, 0 tests failed out of 38


## sw/BUILD/O2Physics-latest/log
/sw/slc7_x86-64/O2/9895-slc7_x86-64-local2/include/SimulationDataFormat/MCTrack.h:24:10: fatal error: TMCProcess.h: No such file or directory
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Sep 24 '22 09:09 alibuild

@ktf : What happens actually at stop/start. Will the timeSlice start again at 0? Then it must also be reset at several other places?

davidrohr avatar Sep 26 '22 13:09 davidrohr

@davidrohr for the O2 tasks (readout/DD/reco/QC/calib) a stop/start reusing an existing environment should be indistinguishable from a fresh start using a new environment.

vascobarroso avatar Sep 26 '22 14:09 vascobarroso

well, depends. If we want to use that stop/start as PAR perhaps not?

davidrohr avatar Sep 26 '22 14:09 davidrohr

Indeed. But I think the current stop/start being developed is closer to the fast SOR/EOR we had during Run 2. PAR will come later.

vascobarroso avatar Sep 26 '22 14:09 vascobarroso

Error while checking build/AliceO2/O2/o2/macOS-arm for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-09-28 19:28:

No log files found

Full log here.

alibuild avatar Sep 26 '22 23:09 alibuild

Error while checking build/AliceO2/O2/o2/macOS for c766e68cbcdcb8340160b095070e9a538ac9788e at 2022-09-28 22:15:

No log files found

Full log here.

alibuild avatar Sep 28 '22 20:09 alibuild

@ktf @davidrohr any reason why this has not yet been merged ?

vascobarroso avatar Oct 06 '22 07:10 vascobarroso

@ktf @davidrohr @shahor02 any chance this will be merged this week ? There is a scheduled START/STOP/START test at P2 on Tuesday next week, but if this will not be there I think it would be better to postpone.

vascobarroso avatar Oct 10 '22 14:10 vascobarroso

Sorry this week was complicated. I just fixed the conflict and I am trying this again.

ktf avatar Oct 14 '22 07:10 ktf

@ktf : shall we merge this now?

davidrohr avatar Oct 14 '22 14:10 davidrohr

Error while checking build/O2/fullCI for 2ab337583a56719868dca0ae174b046b52fee437 at 2023-04-07 09:46:

## sw/BUILD/O2-latest/log
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:506:35: error: no matching function for call to 'o2::framework::ServiceRegistry::get<o2::framework::TimesliceIndex>()'
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:504:13: error: invalid user-defined conversion from 'o2::framework::CommonServices::decongestionSpec()::<lambda(o2::framework::ServiceRegistry&, void*)>' to 'o2::framework::ServiceStopCallback' {aka 'void (*)(o2::framework::ServiceRegistryRef, void*)'} [-fpermissive]
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 15 '22 15:10 alibuild

Error while checking build/O2/o2-cs8 for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-21 08:30:

## sw/BUILD/O2-latest/log
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:464:36: error: no matching function for call to 'o2::framework::ServiceRegistry::get<o2::framework::TimesliceIndex>()'
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:517:32: error: invalid user-defined conversion from 'o2::framework::CommonServices::decongestionSpec()::<lambda(o2::framework::ServiceRegistry&, void*)>' to 'o2::framework::ServiceStopCallback' {aka 'void (*)(o2::framework::ServiceRegistryRef, void*)'} [-fpermissive]
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 15 '22 21:10 alibuild

Error while checking build/O2/o2 for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-25 01:02:

## sw/BUILD/O2-latest/log
/sw/SOURCES/O2/9895-slc7_x86-64/0/Framework/Core/src/CommonServices.cxx:466:36: error: no matching function for call to 'o2::framework::ServiceRegistry::get<o2::framework::TimesliceIndex>()'
/sw/SOURCES/O2/9895-slc7_x86-64/0/Framework/Core/src/CommonServices.cxx:519:32: error: invalid user-defined conversion from 'o2::framework::CommonServices::decongestionSpec()::<lambda(o2::framework::ServiceRegistry&, void*)>' to 'o2::framework::ServiceStopCallback' {aka 'void (*)(o2::framework::ServiceRegistryRef, void*)'} [-fpermissive]
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 17 '22 00:10 alibuild

Error while checking build/O2/o2-dataflow-cs8 for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-25 11:38:

## sw/BUILD/O2-latest/log
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:466:36: error: no matching function for call to 'o2::framework::ServiceRegistry::get<o2::framework::TimesliceIndex>()'
/sw/SOURCES/O2/9895-slc8_x86-64/0/Framework/Core/src/CommonServices.cxx:519:32: error: invalid user-defined conversion from 'o2::framework::CommonServices::decongestionSpec()::<lambda(o2::framework::ServiceRegistry&, void*)>' to 'o2::framework::ServiceStopCallback' {aka 'void (*)(o2::framework::ServiceRegistryRef, void*)'} [-fpermissive]
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 20 '22 21:10 alibuild

Error while checking build/O2/o2-dataflow for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-25 00:35:

## sw/BUILD/O2-latest/log
/sw/SOURCES/O2/9895-slc7_x86-64/0/Framework/Core/src/CommonServices.cxx:466:36: error: no matching function for call to 'o2::framework::ServiceRegistry::get<o2::framework::TimesliceIndex>()'
/sw/SOURCES/O2/9895-slc7_x86-64/0/Framework/Core/src/CommonServices.cxx:519:32: error: invalid user-defined conversion from 'o2::framework::CommonServices::decongestionSpec()::<lambda(o2::framework::ServiceRegistry&, void*)>' to 'o2::framework::ServiceStopCallback' {aka 'void (*)(o2::framework::ServiceRegistryRef, void*)'} [-fpermissive]
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 20 '22 22:10 alibuild

Error while checking build/AliceO2/O2/o2/macOS-arm for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-21 20:33:

## sw/BUILD/O2-latest/log
/System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:464:16: error: no matching member function for call to 'get'
/System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:462:13: error: no viable conversion from '(lambda at /System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:462:13)' to 'o2::framework::ServiceStopCallback' (aka 'void (*)(o2::framework::ServiceRegistryRef, void *)')
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 21 '22 18:10 alibuild

Error while checking build/AliceO2/O2/o2/macOS for 2ab337583a56719868dca0ae174b046b52fee437 at 2022-10-21 22:13:

## sw/BUILD/O2-latest/log
/System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:464:16: error: no matching member function for call to 'get'
/System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:462:13: error: no viable conversion from '(lambda at /System/Volumes/Data/build/alice-ci-workdir/o2/sw/SOURCES/O2/9895/0/Framework/Core/src/CommonServices.cxx:462:13)' to 'o2::framework::ServiceStopCallback' (aka 'void (*)(o2::framework::ServiceRegistryRef, void *)')
ninja: build stopped: subcommand failed.

Full log here.

alibuild avatar Oct 21 '22 20:10 alibuild

This PR did not have any update in the last 30 days. Is it still needed? Unless further action in will be closed in 5 days.

github-actions[bot] avatar Jan 08 '23 01:01 github-actions[bot]

This PR did not have any update in the last 30 days. Is it still needed? Unless further action in will be closed in 5 days.

github-actions[bot] avatar Feb 11 '23 01:02 github-actions[bot]

This PR did not have any update in the last 30 days. Is it still needed? Unless further action in will be closed in 5 days.

github-actions[bot] avatar Apr 01 '23 01:04 github-actions[bot]

this fixes issues I see on the EPN, tested locally, only fixed clang format, merging

davidrohr avatar Nov 15 '23 13:11 davidrohr