Cydral

Results 29 comments of Cydral

@xhuvom As explained by Davis, there is nothing special about learning this type of model... except a lot of effort, of course, to build a database that is as clean...

Hi, no problem. I don't know if your database is shareable but if that's the case, I'm interested in improving the model prepared by my team and now available to...

@mowshon I'm really glad to see that the models I've released can find a usage... I am working on improving the model for enhancing gender detection for certain ethnic groups...

FYI, I'm trying to work on a version 2 of the model, with better database balancing... and a fine-tuning capability for those who want it.

> Have you looked at https://arxiv.org/pdf/2202.08906 and https://arxiv.org/pdf/2308.00951 Yes, I'm familiar with these mechanisms, although strict implementation would be greatly facilitated by support for dynamic network building—which is not typically...

Of course, the shared example is just that—an example. For a more robust MoE implementation, we would likely need to add things like a Gaussian noise layer to improve the...

Despite these constraints, with the recent modifications, it is now possible to build networks in Dlib following the most "modern" architectures, whether for text sequence processing (LM) or image processing...

For now, I’ll stay focused on wrapping up the current work: finalizing the "Static MoE" example, then moving straight to a "Dynamic MoE" example (a draft is available [here](https://github.com/Cydral/dlib-models-training/blob/main/slm_advanced_train_ex2.cpp) for...

No worries at all about the late merge, Davis! Since we're here, I wanted to ask you about something I've been encountering with the example. While it works quite well...

> Ha yeah, same here as the other PR. This ready for review? It's conflicted with master. Maybe they are all ready for review but just need to be merged...