[BUG]: install from source error and pip install error too
🐛 Describe the bug
(python3.10) [ColossalAI]BUILD_EXT=1 pip install . Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple Processing /home/alsc/ColossalAI Preparing metadata (setup.py) ... done Requirement already satisfied: numpy in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (1.22.4) Requirement already satisfied: tqdm in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (4.66.2) Requirement already satisfied: psutil in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (5.9.8) Requirement already satisfied: packaging in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (24.0) Requirement already satisfied: pre-commit in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (3.6.2) Requirement already satisfied: rich in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (13.7.1) Requirement already satisfied: click in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (8.1.7) Requirement already satisfied: fabric in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (3.2.2) Requirement already satisfied: contexttimer in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (0.3.3) Requirement already satisfied: ninja in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (1.11.1.1) Requirement already satisfied: torch>=1.12 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (2.0.1) Requirement already satisfied: safetensors in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (0.4.2) Requirement already satisfied: einops in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (0.7.0) Requirement already satisfied: pydantic in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (2.6.3) Requirement already satisfied: ray in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (2.9.3) Requirement already satisfied: sentencepiece in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (0.2.0) Requirement already satisfied: google in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (3.0.0) Requirement already satisfied: protobuf in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from colossalai==0.3.6) (4.25.3) Requirement already satisfied: filelock in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from torch>=1.12->colossalai==0.3.6) (3.13.1) Requirement already satisfied: typing-extensions in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from torch>=1.12->colossalai==0.3.6) (4.10.0) Requirement already satisfied: sympy in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from torch>=1.12->colossalai==0.3.6) (1.12) Requirement already satisfied: networkx in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from torch>=1.12->colossalai==0.3.6) (3.1) Requirement already satisfied: jinja2 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from torch>=1.12->colossalai==0.3.6) (3.1.3) Requirement already satisfied: invoke>=2.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from fabric->colossalai==0.3.6) (2.2.0) Requirement already satisfied: paramiko>=2.4 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from fabric->colossalai==0.3.6) (3.4.0) Requirement already satisfied: decorator>=5 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from fabric->colossalai==0.3.6) (5.1.1) Requirement already satisfied: deprecated>=1.2 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from fabric->colossalai==0.3.6) (1.2.14) Requirement already satisfied: beautifulsoup4 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from google->colossalai==0.3.6) (4.12.3) Requirement already satisfied: cfgv>=2.0.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pre-commit->colossalai==0.3.6) (3.4.0) Requirement already satisfied: identify>=1.0.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pre-commit->colossalai==0.3.6) (2.5.35) Requirement already satisfied: nodeenv>=0.11.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pre-commit->colossalai==0.3.6) (1.8.0) Requirement already satisfied: pyyaml>=5.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pre-commit->colossalai==0.3.6) (6.0.1) Requirement already satisfied: virtualenv>=20.10.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pre-commit->colossalai==0.3.6) (20.25.1) Requirement already satisfied: annotated-types>=0.4.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pydantic->colossalai==0.3.6) (0.6.0) Requirement already satisfied: pydantic-core==2.16.3 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from pydantic->colossalai==0.3.6) (2.16.3) Requirement already satisfied: jsonschema in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from ray->colossalai==0.3.6) (4.21.1) Requirement already satisfied: msgpack<2.0.0,>=1.0.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from ray->colossalai==0.3.6) (1.0.8) Requirement already satisfied: aiosignal in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from ray->colossalai==0.3.6) (1.3.1) Requirement already satisfied: frozenlist in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from ray->colossalai==0.3.6) (1.4.1) Requirement already satisfied: requests in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from ray->colossalai==0.3.6) (2.31.0) Requirement already satisfied: markdown-it-py>=2.2.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from rich->colossalai==0.3.6) (3.0.0) Requirement already satisfied: pygments<3.0.0,>=2.13.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from rich->colossalai==0.3.6) (2.17.2) Requirement already satisfied: wrapt<2,>=1.10 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from deprecated>=1.2->fabric->colossalai==0.3.6) (1.16.0) Requirement already satisfied: mdurl~=0.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from markdown-it-py>=2.2.0->rich->colossalai==0.3.6) (0.1.2) Requirement already satisfied: setuptools in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from nodeenv>=0.11.1->pre-commit->colossalai==0.3.6) (68.2.2) Requirement already satisfied: bcrypt>=3.2 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from paramiko>=2.4->fabric->colossalai==0.3.6) (4.1.2) Requirement already satisfied: cryptography>=3.3 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from paramiko>=2.4->fabric->colossalai==0.3.6) (42.0.5) Requirement already satisfied: pynacl>=1.5 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from paramiko>=2.4->fabric->colossalai==0.3.6) (1.5.0) Requirement already satisfied: distlib<1,>=0.3.7 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from virtualenv>=20.10.0->pre-commit->colossalai==0.3.6) (0.3.8) Requirement already satisfied: platformdirs<5,>=3.9.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from virtualenv>=20.10.0->pre-commit->colossalai==0.3.6) (4.2.0) Requirement already satisfied: soupsieve>1.2 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from beautifulsoup4->google->colossalai==0.3.6) (2.5) Requirement already satisfied: MarkupSafe>=2.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from jinja2->torch>=1.12->colossalai==0.3.6) (2.1.1) Requirement already satisfied: attrs>=22.2.0 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from jsonschema->ray->colossalai==0.3.6) (23.2.0) Requirement already satisfied: jsonschema-specifications>=2023.03.6 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from jsonschema->ray->colossalai==0.3.6) (2023.12.1) Requirement already satisfied: referencing>=0.28.4 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from jsonschema->ray->colossalai==0.3.6) (0.33.0) Requirement already satisfied: rpds-py>=0.7.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from jsonschema->ray->colossalai==0.3.6) (0.18.0) Requirement already satisfied: charset-normalizer<4,>=2 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from requests->ray->colossalai==0.3.6) (2.0.4) Requirement already satisfied: idna<4,>=2.5 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from requests->ray->colossalai==0.3.6) (3.4) Requirement already satisfied: urllib3<3,>=1.21.1 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from requests->ray->colossalai==0.3.6) (2.1.0) Requirement already satisfied: certifi>=2017.4.17 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from requests->ray->colossalai==0.3.6) (2024.2.2) Requirement already satisfied: mpmath>=0.19 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from sympy->torch>=1.12->colossalai==0.3.6) (1.3.0) Requirement already satisfied: cffi>=1.12 in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from cryptography>=3.3->paramiko>=2.4->fabric->colossalai==0.3.6) (1.16.0) Requirement already satisfied: pycparser in /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages (from cffi>=1.12->cryptography>=3.3->paramiko>=2.4->fabric->colossalai==0.3.6) (2.21) Building wheels for collected packages: colossalai Building wheel for colossalai (setup.py) ... /
running build_ext
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py:388: UserWarning: The detected CUDA version (11.0) has a minor version mismatch with the version that was used to compile PyTorch (11.8). Most likely this shouldn't be a problem.
warnings.warn(CUDA_MISMATCH_WARN.format(cuda_str_version, torch.version.cuda))
building 'colossalai.C.cpu_adam_x86' extension
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc
creating /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda
Emitting ninja build file /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/1] c++ -MMD -MF /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.o.d -pthread -B /home/alsc/anaconda3/envs/python3.10/compiler_compat -Wno-unused-result -Wsign-compare -DNDEBUG -g -fwrapv -O2 -Wall -fPIC -O2 -isystem /home/alsc/anaconda3/envs/python3.10/include -fPIC -O2 -isystem /home/alsc/anaconda3/envs/python3.10/include -fPIC -I/home/alsc/ColossalAI/extensions/csrc/includes -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/TH -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/THC -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/include/python3.10 -c -c /home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.cpp -o /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.o -O3 -DVERSION_GE_1_1 -DVERSION_GE_1_3 -DVERSION_GE_1_5 -std=c++14 -std=c++17 -lcudart -lcublas -g -Wno-reorder -fopenmp -march=native -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="gcc"' '-DPYBIND11_STDLIB="libstdcpp"' '-DPYBIND11_BUILD_ABI="cxxabi1011"' -DTORCH_EXTENSION_NAME=cpu_adam_x86 -D_GLIBCXX_USE_CXX11_ABI=0
/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.cpp:237: warning: ignoring #pragma unroll [-Wunknown-pragmas]
237 | #pragma unroll 4
|
/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.cpp:352: warning: ignoring #pragma unroll [-Wunknown-pragmas]
352 | #pragma unroll 8
|
In file included from /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/Exceptions.h:14,
from /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include/torch/python.h:11,
from /home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/extension.h:6,
from /home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.h:29,
from /home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.cpp:22:
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/pybind11/pybind11.h: In instantiation of ‘class pybind11::class<Adam_Optimizer>’:
/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.cpp:443:51: required from here
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/pybind11/pybind11.h:1479:7: warning: ‘pybind11::class<Adam_Optimizer>’ declared with greater visibility than its base ‘pybind11::detail::generic_type’ [-Wattributes]
1479 | class class : public detail::generic_type {
| ^~~~~~
g++ -pthread -B /home/alsc/anaconda3/envs/python3.10/compiler_compat -shared -Wl,-rpath,/home/alsc/anaconda3/envs/python3.10/lib -Wl,-rpath-link,/home/alsc/anaconda3/envs/python3.10/lib -L/home/alsc/anaconda3/envs/python3.10/lib -Wl,-rpath,/home/alsc/anaconda3/envs/python3.10/lib -Wl,-rpath-link,/home/alsc/anaconda3/envs/python3.10/lib -L/home/alsc/anaconda3/envs/python3.10/lib /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/cpu_adam.o -L/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/lib -L/usr/local/cuda/lib64 -lc10 -ltorch -ltorch_cpu -ltorch_python -lcudart -lc10_cuda -ltorch_cuda -o build/lib.linux-x86_64-cpython-310/colossalai/C/cpu_adam_x86.cpython-310-x86_64-linux-gnu.so
building 'colossalai.C.layernorm_cuda' extension
Emitting ninja build file /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/2] /usr/local/cuda/bin/nvcc -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/TH -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/THC -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/include/python3.10 -c -c /home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda_kernel.cu -o /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS_ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O3 --use_fast_math -maxrregcount=50 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_61,code=sm_61 -gencode arch=compute_70,code=sm_70 -DVERSION_GE_1_1 -DVERSION_GE_1_3 -DVERSION_GE_1_5 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="gcc"' '-DPYBIND11_STDLIB="libstdcpp"' '-DPYBIND11_BUILD_ABI="cxxabi1011"' -DTORCH_EXTENSION_NAME=layernorm_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++17
FAILED: /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda_kernel.o
/usr/local/cuda/bin/nvcc -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/TH -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/THC -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/include/python3.10 -c -c /home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda_kernel.cu -o /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS_ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O3 --use_fast_math -maxrregcount=50 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_61,code=sm_61 -gencode arch=compute_70,code=sm_70 -DVERSION_GE_1_1 -DVERSION_GE_1_3 -DVERSION_GE_1_5 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=layernorm_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++17
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/c10/util/irange.h(54): warning: pointless comparison of unsigned integer with zero
detected during:
instantiation of "__nv_bool c10::detail::integer_iterator<I, one_sided,
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/c10/util/irange.h(54): warning: pointless comparison of unsigned integer with zero
detected during:
instantiation of "__nv_bool c10::detail::integer_iterator<I, one_sided, <unnamed>>::operator==(const c10::detail::integer_iterator<I, one_sided, <unnamed>> &) const [with I=std::size_t, one_sided=true, <unnamed>=0]"
(61): here
instantiation of "__nv_bool c10::detail::integer_iterator<I, one_sided, <unnamed>>::operator!=(const c10::detail::integer_iterator<I, one_sided, <unnamed>> &) const [with I=std::size_t, one_sided=true, <unnamed>=0]"
/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/ATen/core/qualified_name.h(73): here
nvcc error : 'cicc' died due to signal 11 (Invalid memory reference)
[2/2] c++ -MMD -MF /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda.o.d -pthread -B /home/alsc/anaconda3/envs/python3.10/compiler_compat -Wno-unused-result -Wsign-compare -DNDEBUG -g -fwrapv -O2 -Wall -fPIC -O2 -isystem /home/alsc/anaconda3/envs/python3.10/include -fPIC -O2 -isystem /home/alsc/anaconda3/envs/python3.10/include -fPIC -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/TH -I/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/include/THC -I/usr/local/cuda/include -I/home/alsc/anaconda3/envs/python3.10/include/python3.10 -c -c /home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda.cpp -o /home/alsc/ColossalAI/build/temp.linux-x86_64-cpython-310/home/alsc/ColossalAI/extensions/csrc/cuda/layer_norm_cuda.o -O3 -DVERSION_GE_1_1 -DVERSION_GE_1_3 -DVERSION_GE_1_5 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=layernorm_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++17
ninja: build stopped: subcommand failed.
Traceback (most recent call last):
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py", line 1893, in _run_ninja_build
subprocess.run(
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/subprocess.py", line 524, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "<string>", line 2, in <module>
File "<pip-setuptools-caller>", line 34, in <module>
File "/home/alsc/ColossalAI/setup.py", line 100, in <module>
setup(
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/__init__.py", line 103, in setup
return distutils.core.setup(**attrs)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 185, in setup
return run_commands(dist)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 201, in run_commands
dist.run_commands()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 969, in run_commands
self.run_command(cmd)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/dist.py", line 989, in run_command
super().run_command(command)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
cmd_obj.run()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/wheel/bdist_wheel.py", line 364, in run
self.run_command("build")
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 318, in run_command
self.distribution.run_command(command)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/dist.py", line 989, in run_command
super().run_command(command)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
cmd_obj.run()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/command/build.py", line 131, in run
self.run_command(cmd_name)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 318, in run_command
self.distribution.run_command(command)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/dist.py", line 989, in run_command
super().run_command(command)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
cmd_obj.run()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/command/build_ext.py", line 88, in run
_build_ext.run(self)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 345, in run
self.build_extensions()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py", line 843, in build_extensions
build_ext.build_extensions(self)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 467, in build_extensions
self._build_extensions_serial()
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 493, in _build_extensions_serial
self.build_extension(ext)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/command/build_ext.py", line 249, in build_extension
_build_ext.build_extension(self, ext)
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 548, in build_extension
objects = self.compiler.compile(
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py", line 658, in unix_wrap_ninja_compile
_write_ninja_file_and_compile_objects(
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py", line 1574, in _write_ninja_file_and_compile_objects
_run_ninja_build(
File "/home/alsc/anaconda3/envs/python3.10/lib/python3.10/site-packages/torch/utils/cpp_extension.py", line 1909, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error compiling objects for extension
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for colossalai Running setup.py clean for colossalai
Environment
(python3.10) [alsc@ColossalAI]$ conda list
packages in environment at /home/alsc/anaconda3/envs/python3.10:
Name Version Build Channel
_libgcc_mutex 0.1 main defaults
absl-py 2.1.0