s4 icon indicating copy to clipboard operation
s4 copied to clipboard

segmentation fault when running python -m train pipeline=mnist model=s4

Open liluo2 opened this issue 11 months ago • 0 comments

When I was running with the command python -X faulthandler -m train pipeline=mnist model=s4, I found that there's a segmentation fault. I located the problematic code with:

File "/home/liluo/code/s4/src/models/hippo/hippo.py", line 78 in transition                                            
File "/home/liluo/code/s4/src/models/hippo/hippo.py", line 220 in nplr                                                 
File "/home/liluo/code/s4/src/models/sequence/kernels/dplr.py", line 157 in ssm                                        
File "/home/liluo/code/s4/src/models/sequence/kernels/dplr.py", line 176 in <listcomp>                                 
File "/home/liluo/code/s4/src/models/sequence/kernels/dplr.py", line 176 in combination
  File "/home/liluo/code/s4/src/models/sequence/kernels/ssm.py", line 173 in init_ssm_dplr
  File "/home/liluo/code/s4/src/models/sequence/kernels/ssm.py", line 479 in __init__
  File "/home/liluo/code/s4/src/models/sequence/kernels/fftconv.py", line 71 in __init__
  File "/home/liluo/code/s4/src/utils/config.py", line 101 in instantiate
  File "/home/liluo/code/s4/src/models/sequence/modules/s4block.py", line 109 in __init__
  File "/home/liluo/code/s4/src/utils/config.py", line 101 in instantiate
  File "/home/liluo/code/s4/src/models/sequence/backbones/block.py", line 48 in __init__
  File "/home/liluo/code/s4/src/models/sequence/backbones/model.py", line 84 in __init__
  File "/home/liluo/code/s4/src/utils/config.py", line 101 in instantiate
  File "/home/liluo/code/s4/train.py", line 171 in setup
  File "/home/liluo/code/s4/train.py", line 147 in __init__
  File "/home/liluo/code/s4/train.py", line 701 in train
  File "/home/liluo/code/s4/train.py", line 830 in main

It seems that A = T @ M @ np.linalg.inv(T) causes segment fault. I don't know how to solve this problem... I set ulimit -s 65536 but it's not working.

My environments:

#
# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                        main    defaults
_openmp_mutex             5.1                       1_gnu    defaults
_sysroot_linux-64_curr_repodata_hack 3                   haa98f57_10    defaults
absl-py                   2.1.0                    pypi_0    pypi
aiohttp                   3.9.3                    pypi_0    pypi
aiosignal                 1.3.1                    pypi_0    pypi
annotated-types           0.6.0                    pypi_0    pypi
antlr4-python3-runtime    4.9.3                    pypi_0    pypi
appdirs                   1.4.4                    pypi_0    pypi
asttokens                 2.4.1                    pypi_0    pypi
async-timeout             4.0.3                    pypi_0    pypi
attrs                     23.2.0                   pypi_0    pypi
binutils_impl_linux-64    2.35.1               h27ae35d_9    defaults
binutils_linux-64         2.35.1              h454624a_30    defaults
black                     23.7.0                   pypi_0    pypi
blas                      1.0                         mkl    defaults
brotli                    1.0.9                h5eee18b_7    defaults
brotli-bin                1.0.9                h5eee18b_7    defaults
bzip2                     1.0.8                h5eee18b_5    defaults
c-ares                    1.19.1               h5eee18b_0    anaconda
ca-certificates           2023.12.12           h06a4308_0    defaults
certifi                   2024.2.2                 pypi_0    pypi
cfgv                      3.4.0                    pypi_0    pypi
charset-normalizer        3.3.2                    pypi_0    pypi
click                     8.1.7                    pypi_0    pypi
cmake                     3.28.3                   pypi_0    pypi
comm                      0.2.2                    pypi_0    pypi
contourpy                 1.2.0            py39hdb19cb5_0    defaults
coverage                  7.4.3                    pypi_0    pypi
cycler                    0.11.0             pyhd3eb1b0_0    defaults
cyrus-sasl                2.1.28               h52b45da_1    defaults
datasets                  2.18.0                   pypi_0    pypi
dbus                      1.13.18              hb2f20db_0    defaults
debugpy                   1.8.1                    pypi_0    pypi
decorator                 5.1.1                    pypi_0    pypi
dill                      0.3.8                    pypi_0    pypi
distlib                   0.3.8                    pypi_0    pypi
docformatter              1.7.5                    pypi_0    pypi
docker-pycreds            0.4.0                    pypi_0    pypi
einops                    0.7.0                    pypi_0    pypi
exceptiongroup            1.2.0                    pypi_0    pypi
executing                 2.0.1                    pypi_0    pypi
expat                     2.5.0                h6a678d5_0    defaults
filelock                  3.13.1                   pypi_0    pypi
flake8                    7.0.0                    pypi_0    pypi
flake8-bugbear            24.2.6                   pypi_0    pypi
flake8-comprehensions     3.14.0                   pypi_0    pypi
fontconfig                2.14.1               h4c34cd2_2    defaults
fonttools                 4.25.0             pyhd3eb1b0_0    defaults
freetype                  2.12.1               h4a9f257_0    defaults
frozenlist                1.4.1                    pypi_0    pypi
fsspec                    2024.2.0                 pypi_0    pypi
gcc_impl_linux-64         9.3.0               h6df7d76_17    defaults
gcc_linux-64              9.3.0               h1ee779e_30    defaults
gitdb                     4.0.11                   pypi_0    pypi
gitpython                 3.1.42                   pypi_0    pypi
glib                      2.78.4               h6a678d5_0    defaults
glib-tools                2.78.4               h6a678d5_0    defaults
gluonts                   0.14.4                   pypi_0    pypi
grpcio                    1.62.1                   pypi_0    pypi
gst-plugins-base          1.14.1               h6a678d5_1    defaults
gstreamer                 1.14.1               h5eee18b_1    defaults
gxx_impl_linux-64         9.3.0               hbdd7822_17    defaults
gxx_linux-64              9.3.0               h7e70986_30    defaults
h5py                      3.9.0            py39he06866b_0    anaconda
hdf5                      1.12.1               h2b7332f_3    anaconda
huggingface-hub           0.21.4                   pypi_0    pypi
hydra-core                1.3.2                    pypi_0    pypi
icu                       73.1                 h6a678d5_0    defaults
identify                  2.5.35                   pypi_0    pypi
idna                      3.6                      pypi_0    pypi
importlib-metadata        7.0.2                    pypi_0    pypi
importlib_resources       6.1.1            py39h06a4308_1    defaults
intel-openmp              2023.1.0         hdb19cb5_46306    defaults
ipython                   8.18.1                   pypi_0    pypi
ipywidgets                8.1.2                    pypi_0    pypi
isort                     5.13.2                   pypi_0    pypi
jedi                      0.19.1                   pypi_0    pypi
jinja2                    3.1.3                    pypi_0    pypi
joblib                    1.2.0            py39h06a4308_0    defaults
jpeg                      9e                   h5eee18b_1    defaults
jupyterlab-widgets        3.0.10                   pypi_0    pypi
keopscore                 2.2.2                    pypi_0    pypi
kernel-headers_linux-64   3.10.0              h57e8cba_10    defaults
kiwisolver                1.4.4            py39h6a678d5_0    defaults
krb5                      1.20.1               h143b758_1    defaults
lcms2                     2.12                 h3be6417_0    defaults
ld_impl_linux-64          2.35.1               h7274673_9    defaults
lerc                      3.0                  h295c915_0    defaults
libbrotlicommon           1.0.9                h5eee18b_7    defaults
libbrotlidec              1.0.9                h5eee18b_7    defaults
libbrotlienc              1.0.9                h5eee18b_7    defaults
libclang                  14.0.6          default_hc6dbbc7_1    defaults
libclang13                14.0.6          default_he11475f_1    defaults
libcups                   2.4.2                h2d74bed_1    defaults
libcurl                   7.88.1               h251f7ec_2    anaconda
libdeflate                1.17                 h5eee18b_1    defaults
libedit                   3.1.20230828         h5eee18b_0    defaults
libev                     4.33                 h7f8727e_1    anaconda
libffi                    3.4.4                h6a678d5_0    defaults
libgcc-devel_linux-64     9.3.0               hb95220a_17    defaults
libgcc-ng                 11.2.0               h1234567_1    defaults
libgfortran-ng            11.2.0               h00389a5_1    defaults
libgfortran5              11.2.0               h1234567_1    defaults
libglib                   2.78.4               hdc74915_0    defaults
libgomp                   11.2.0               h1234567_1    defaults
libiconv                  1.16                 h7f8727e_2    defaults
libllvm14                 14.0.6               hdb19cb5_3    defaults
libnghttp2                1.57.0               h2d74bed_0    anaconda
libpng                    1.6.39               h5eee18b_0    defaults
libpq                     12.17                hdbd6064_0    defaults
libssh2                   1.10.0               hdbd6064_2    anaconda
libstdcxx-devel_linux-64  9.3.0               hf0c5c8d_17    defaults
libstdcxx-ng              11.2.0               h1234567_1    defaults
libtiff                   4.5.1                h6a678d5_0    defaults
libuuid                   1.41.5               h5eee18b_0    defaults
libwebp-base              1.3.2                h5eee18b_0    defaults
libxcb                    1.15                 h7f8727e_0    defaults
libxkbcommon              1.0.1                h5eee18b_1    defaults
libxml2                   2.10.4               hf1b16e4_1    defaults
lightning-utilities       0.10.1                   pypi_0    pypi
lz4-c                     1.9.4                h6a678d5_0    defaults
markdown                  3.5.2                    pypi_0    pypi
markdown-it-py            3.0.0                    pypi_0    pypi
markupsafe                2.1.5                    pypi_0    pypi
matplotlib                3.8.0            py39h06a4308_0    defaults
matplotlib-base           3.8.0            py39h1128e8f_0    defaults
matplotlib-inline         0.1.6                    pypi_0    pypi
mccabe                    0.7.0                    pypi_0    pypi
mdurl                     0.1.2                    pypi_0    pypi
mkl                       2023.1.0         h213fc3f_46344    defaults
mkl-service               2.4.0            py39h5eee18b_1    defaults
mkl_fft                   1.3.8            py39h5eee18b_0    defaults
mkl_random                1.2.4            py39hdb19cb5_0    defaults
mp-pde-solvers            1.0                       dev_0    <develop>
multidict                 6.0.5                    pypi_0    pypi
multiprocess              0.70.16                  pypi_0    pypi
munch                     4.0.0                    pypi_0    pypi
munkres                   1.1.4                      py_0    defaults
mypy-extensions           1.0.0                    pypi_0    pypi
mysql                     5.7.24               h721c034_2    defaults
ncurses                   6.4                  h6a678d5_0    defaults
nengo                     4.0.0                    pypi_0    pypi
nodeenv                   1.8.0                    pypi_0    pypi
numpy                     1.26.4           py39h5f9d8c6_0    defaults
numpy-base                1.26.4           py39hb5e798b_0    defaults
omegaconf                 2.3.0                    pypi_0    pypi
openjpeg                  2.4.0                h3ad879b_0    defaults
openssl                   3.0.13               h7f8727e_0    defaults
packaging                 23.1             py39h06a4308_0    defaults
pandas                    2.1.4                    pypi_0    pypi
parso                     0.8.3                    pypi_0    pypi
pathspec                  0.12.1                   pypi_0    pypi
pcre2                     10.42                hebb0a14_0    defaults
pexpect                   4.9.0                    pypi_0    pypi
pillow                    10.2.0           py39h5eee18b_0    defaults
pip                       23.3.1           py39h06a4308_0    defaults
platformdirs              4.2.0                    pypi_0    pypi
ply                       3.11             py39h06a4308_0    defaults
pre-commit                3.6.2                    pypi_0    pypi
prompt-toolkit            3.0.43                   pypi_0    pypi
protobuf                  4.25.3                   pypi_0    pypi
psutil                    5.9.8                    pypi_0    pypi
ptyprocess                0.7.0                    pypi_0    pypi
pure-eval                 0.2.2                    pypi_0    pypi
pyarrow                   15.0.1                   pypi_0    pypi
pyarrow-hotfix            0.6                      pypi_0    pypi
pybind11                  2.11.1                   pypi_0    pypi
pycodestyle               2.11.1                   pypi_0    pypi
pydantic                  2.6.4                    pypi_0    pypi
pydantic-core             2.16.3                   pypi_0    pypi
pyflakes                  3.2.0                    pypi_0    pypi
pyg-lib                   0.4.0+pt113cu116          pypi_0    pypi
pygments                  2.17.2                   pypi_0    pypi
pykeops                   2.2.2                    pypi_0    pypi
pyparsing                 3.0.9            py39h06a4308_0    defaults
pyqt                      5.15.10          py39h6a678d5_0    defaults
pyqt5-sip                 12.13.0          py39h5eee18b_0    defaults
python                    3.9.18               h955ad1f_0    defaults
python-dateutil           2.8.2              pyhd3eb1b0_0    defaults
pytorch-lightning         2.0.4                    pypi_0    pypi
pytz                      2024.1                   pypi_0    pypi
pyyaml                    6.0.1                    pypi_0    pypi
qt-main                   5.15.2              h53bd1ea_10    defaults
readline                  8.2                  h5eee18b_0    defaults
regex                     2023.12.25               pypi_0    pypi
requests                  2.31.0                   pypi_0    pypi
rich                      13.7.1                   pypi_0    pypi
safetensors               0.4.2                    pypi_0    pypi
scikit-learn              1.3.0            py39h1128e8f_1    defaults
scipy                     1.11.4           py39h5f9d8c6_0    defaults
seaborn                   0.13.2                   pypi_0    pypi
sentry-sdk                1.41.0                   pypi_0    pypi
setproctitle              1.3.3                    pypi_0    pypi
setuptools                68.2.2           py39h06a4308_0    defaults
sip                       6.7.12           py39h6a678d5_0    defaults
six                       1.16.0             pyhd3eb1b0_1    defaults
smmap                     5.0.1                    pypi_0    pypi
sqlite                    3.41.2               h5eee18b_0    defaults
stack-data                0.6.3                    pypi_0    pypi
structured-kernels        0.1.0                    pypi_0    pypi
sysroot_linux-64          2.17                h57e8cba_10    defaults
tbb                       2021.8.0             hdb19cb5_0    defaults
tensorboard               2.16.2                   pypi_0    pypi
tensorboard-data-server   0.7.2                    pypi_0    pypi
threadpoolctl             2.2.0              pyh0d69192_0    defaults
timm                      0.5.4                    pypi_0    pypi
tk                        8.6.12               h1ccaba5_0    defaults
tokenizers                0.15.2                   pypi_0    pypi
tomli                     2.0.1            py39h06a4308_0    defaults
toolz                     0.12.1                   pypi_0    pypi
torch                     1.13.1+cu116             pypi_0    pypi
torch-cluster             1.6.1+pt113cu116          pypi_0    pypi
torch-geometric           2.3.0                    pypi_0    pypi
torch-scatter             2.1.1+pt113cu116          pypi_0    pypi
torch-sparse              0.6.17+pt113cu116          pypi_0    pypi
torch-spline-conv         1.2.2+pt113cu116          pypi_0    pypi
torchaudio                0.13.1+cu116             pypi_0    pypi
torchmetrics              1.3.1                    pypi_0    pypi
torchtext                 0.14.1                   pypi_0    pypi
torchvision               0.14.1+cu116             pypi_0    pypi
tornado                   6.3.3            py39h5eee18b_0    defaults
tqdm                      4.66.2                   pypi_0    pypi
traitlets                 5.14.2                   pypi_0    pypi
transformers              4.38.2                   pypi_0    pypi
typing-extensions         4.10.0                   pypi_0    pypi
tzdata                    2024.1                   pypi_0    pypi
untokenize                0.1.1                    pypi_0    pypi
urllib3                   2.2.1                    pypi_0    pypi
virtualenv                20.25.1                  pypi_0    pypi
wandb                     0.16.4                   pypi_0    pypi
wcwidth                   0.2.13                   pypi_0    pypi
werkzeug                  3.0.1                    pypi_0    pypi
wheel                     0.41.2           py39h06a4308_0    defaults
widgetsnbextension        4.0.10                   pypi_0    pypi
xxhash                    3.4.1                    pypi_0    pypi
xz                        5.4.6                h5eee18b_0    defaults
yarl                      1.9.4                    pypi_0    pypi
zipp                      3.17.0           py39h06a4308_0    defaults
zlib                      1.2.13               h5eee18b_0    defaults
zstd                      1.5.5                hc292b87_0    defaults

With Ubuntu 22.04.3 LTS (GNU/Linux 6.2.0-33-generic x86_64). Also, cuda_extension is installed successfully. So I don't know if you guys have some ideas. Looking forward to your reply. Thank you!

liluo2 avatar Mar 13 '24 16:03 liluo2