Sihan Chen
                                        Results
                                        3
                                        repositories owned by
                                        
                                
                                            Sihan Chen
                                        
                                    VAST
                            
                                235
                            
                            
                        
                        Stars
                    
                            
                                15
                            
                            
                        
                        Forks
                    Watchers
                    Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
COSA
                            
                                38
                            
                            
                        
                        Stars
                    
                            
                                2
                            
                            
                        
                        Forks
                    Watchers
                    Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
VALOR
                            
                                259
                            
                            
                        
                        Stars
                    
                            
                                15
                            
                            
                        
                        Forks
                    Watchers
                    Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset