SG-XM issues

Results 7 issues of


                                            SG-XM

fmax 8000,会对模型有什么影响吗

想做小样本学习100样本左右,微调 tacotron 的 decoder 部分 #507 想知道fmax8000的话会对语音的相似度有什么影响吗,另外输出的这个 attention 图代表什么呢,横轴是步数,纵轴是attention,比如下面的这些输出该怎么分析呢,横轴代表步数的话为什么不是递增呢,这个图该怎么看呀万分感谢!!希望可以深入交流 ![attention_step_118000_sample_1](https://user-images.githubusercontent.com/32589854/178210021-248c25be-dddd-4339-967a-3aef06a9dda8.png) ![attention_step_118500_sample_1](https://user-images.githubusercontent.com/32589854/178210052-f09aa349-2974-457c-bb06-07d35988fe82.png) ![attention_step_119000_sample_1](https://user-images.githubusercontent.com/32589854/178210063-d40044de-8cd5-4194-b53d-c87875f579de.png) ![attention_step_119500_sample_1](https://user-images.githubusercontent.com/32589854/178210070-edfcd072-0133-463e-b9cb-a5b84594ccf6.png) ![attention_step_120000_sample_1](https://user-images.githubusercontent.com/32589854/178210086-0f840750-3f82-4ffa-a4ac-7dc260f4d45c.png)

一块 2080ti ,做 finetune 的话速度大概多少 step/sec 比较合理

样本数量大概在100条左右,冻结了tacotron的前面的参数,0.3step的速度合理吗,应该大概在什么数量级呀,以及 loss大概在什么数量级的时候效果会比较好呢

mmselfsup 的数据集只能是图片数据集吗, train pipeline 中只看到 LoadImageFromFile

我如果有其他形式的数据集想用来做 ssl, 以及自定义 pretext task, 要如何适配数据集(比如最经典的IRIS)? 还是说现在的代码设计架构完全不支持其它类型的数据集呢? 感谢您的回答,祝工作顺利,身体健康!

How to init CropSetting

For this case: I use CropWidget opening an image with fixed croparea, and drag imge , scale image, then i need to save the transformer of image. Because I need...

Is crop function lock image object?

I want to crop one Image into several part. so I have some code below: ``` dart List resTask = []; for (int i = 0; i < param.type; i++)...

[Question]: RAG-Graph Generate-Answer with history message in chat api

### Describe your problem "I have a very complex graph model, and in one of the processes, I connect an answer generation component with a human-computer interaction component. In this...

bug

[BUG] 任何一张其他的图片都会在这里报错，是图片格式问题吗，图片美颜功能在iOS上无法使用

![sample_face](https://github.com/pixpark/gpupixel/assets/32589854/251ea8f2-3439-4e8e-b27b-78131036b2b0) ![9ca2bf9e-f9b6-41de-b1cc-a4408c1ec745](https://github.com/pixpark/gpupixel/assets/32589854/898d7e37-fa9a-4e27-8cdc-9bb41b2e1d71) VNN_Apply_Face_CPU 函数报错，在SourceImage create中我跟踪进入，发现图片原始channel count确实是3，不是4，但是这是否会对VNN造成影响？因为我不太熟悉VNN，实在无法解决这个问题。我尝试过用同名png替换项目内的sample_face.png文件仍然存在问题。我使用在线工具对项目内的sample_face.png进行裁剪，放回项目内依旧没有问题，故而排除了像素问题。但我不知道下一步该怎么办，我尝试修改stbi_load让他强制加载4通道，但是似乎没有用，而且还会影响GL纹理绑定。

bug