PatientLevelPrediction icon indicating copy to clipboard operation
PatientLevelPrediction copied to clipboard

Error in getPLP when using Redshift

Open rfherrera opened this issue 2 years ago • 1 comments

Describe the bug When running getPLP, I am getting the following error java.sql.SQLException: Amazon Invalid operation: Value out of range for 4 bytes. Details: ----------------------------------------------- error: Value out of range for 4 bytes. code: 8001 context: Input:21605144410. query: 26832412 location: numeric_bound.cpp:61 process: query1_302_26832412 [pid=11450]

Set up (please run in R "sessionInfo()" and copy the output here): R version 4.0.3 (2020-10-10) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Ubuntu 20.04 LTS

Matrix products: default BLAS/LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.8.so

locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8
[4] LC_COLLATE=en_US.UTF-8 LC_MONETARY=en_US.UTF-8 LC_MESSAGES=C
[7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 LC_ADDRESS=en_US.UTF-8
[10] LC_TELEPHONE=en_US.UTF-8 LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8

attached base packages: [1] stats graphics grDevices utils datasets methods base

other attached packages: [1] PheValuator_2.1.5 FeatureExtraction_3.2.0 Andromeda_0.6.0
[4] dplyr_1.0.9 DatabaseConnector_5.0.2

loaded via a namespace (and not attached): [1] Rcpp_1.0.8.3 pillar_1.7.0 compiler_4.0.3
[4] tools_4.0.3 bit_4.0.4 jsonlite_1.8.0
[7] lattice_0.20-41 RSQLite_2.2.12 memoise_2.0.1
[10] lifecycle_1.0.1 tibble_3.1.6 pkgconfig_2.0.3
[13] rlang_1.0.2 Matrix_1.3-2 rstudioapi_0.13
[16] DBI_1.1.2 cli_3.3.0 PatientLevelPrediction_5.0.5 [19] fastmap_1.1.0 rJava_1.0-6 generics_0.1.2
[22] vctrs_0.4.1 hms_1.1.1 grid_4.0.3
[25] triebeard_0.3.0 bit64_4.0.5 xgboost_1.2.0.1
[28] tidyselect_1.1.2 data.table_1.14.2 glue_1.6.2
[31] R6_2.5.1 fansi_1.0.3 purrr_0.3.4
[34] ParallelLogger_2.0.2 SqlRender_1.9.0 blob_1.2.3
[37] magrittr_2.0.3 urltools_1.7.3 ellipsis_0.3.2
[40] assertthat_0.2.1 utf8_1.2.2 stringi_1.7.6
[43] cachem_1.0.6 crayon_1.5.1

rfherrera avatar May 02 '22 03:05 rfherrera

Hi Ron,

That error seems to suggest for some reason there is a large number somewhere (either in the database or being created). What features are you extracting? My guess is that this error is most likely coming from FeatureExtraction. There is code in getPlp for sampling that is casting to BigInt that is the only place I can see in PatientLevelPrediction potentially being responsible, are you sampling any patients?

Best wishes, Jenna

jreps avatar May 02 '22 12:05 jreps