lokvke
lokvke
> Hi @lokvke, Thank you very much for your interest in CVCUDA! > > Could you share some information on your use-case? Eg. would you like to get a ROI...
> Hi @lokvke, > > then you could you the padandstack operator, which gets as an input an ImageBatchVarSahpe and outputs a Tensor of the cropped size. Below is a...
> Another question, can I convert ImageBatchVatShape to nvcv.Tensor or torch.Tensor directly? As far as I know, I have to process it one by one. +1
@yulj21 请问作者提供的May预训练模型是不是不支持对中文音频的合成呀?
in the inject_blink_to_lm68 function, when the generated video contatins 676 frames, T=676. So when i=675, j=1, the idx=676(out of index), here is my solution: **idx = i % (i +...
eye_blink_dim: 2 in lm3d_radnerf_sr.yaml eye_blink_dim: 4 in lm3d_radnerf_torso_sr.yaml
After 250000 steps' training, the torso part still be rendered only with the head model. This didn't happen in geneface project.
短句合成音频混乱,请问解决了吗?
GaussianTalker速度和效果怎么样
是用internvl2-26b提取图片中的类别名称时,偶尔也会出现复读的情况,如图所示: 