so-vits-svc [mps] issue with Apple silicon compatibility

[mps] issue with Apple silicon compatibility

Open magic-akari opened this issue 1 year ago • 1 comments

OS version

Darwin arm64

GPU

mps

Python version

Python 3.8.16

PyTorch version

2.0.0

Branch of sovits

4.0(Default)

Dataset source (Used to judge the dataset quality)

N/A

Where thr problem occurs or what command you executed

inference

Situation description

Tips:

use PYTORCH_ENABLE_MPS_FALLBACK=1
use -d mps

issues:

[x] f0_mean_pooling #142
[ ] enhance

related codes:

https://github.com/svc-develop-team/so-vits-svc/blob/0298cd448fc699732e29d8951c96bc02dcc347ce/vdecoder/nsf_hifigan/models.py#L144-L146

https://github.com/svc-develop-team/so-vits-svc/blob/0298cd448fc699732e29d8951c96bc02dcc347ce/vdecoder/nsf_hifigan/models.py#L159-L162

There are some double type casts in the source code. Is it required?

Some methods related to double are not implemented in mps devices.

I think float is enough, but I am not sure. I have modified and tested locally, and it works well.

Is there a significant loss of precision in moving the torch.cumsum operation from double to float?

CC: @ylzz1997

Log

N/A

Supplementary description

No response

Apr 18 '23 05:04 magic-akari

In theory, it doesn't matter. F0 normalized to 0-1 should be accurate to 1e-7, flaot just right.

But the cumsum operation after the float type converted to double is written by the person who wrote NFS_HIFIGAN. What exactly is the effect that can be raised issue under the NFS_HIFIGAN project.

Apr 18 '23 10:04 ylzz1997

so-vits-svc so-vits-svc copied to clipboard

[mps] issue with Apple silicon compatibility

OS version

GPU

Python version

PyTorch version

Branch of sovits

Dataset source (Used to judge the dataset quality)

Where thr problem occurs or what command you executed

Situation description

Log

Supplementary description

so-vits-svc
so-vits-svc copied to clipboard