datacuration topic
                        List
                        datacuration repositories
                    
                scholia
                            
                                212
                            
                            
                        
                        Stars
                    
                            
                                77
                            
                            
                        
                        Forks
                    Watchers
                    Wikidata-based scholarly profiles
library
                            
                                180
                            
                            
                        
                        Stars
                    
                            
                                6
                            
                            
                        
                        Forks
                    Watchers
                    70+ CLI tools to build, browse, and blend your media library. An index for your archive.
data-prep-kit
                            
                                235
                            
                            
                        
                        Stars
                    
                            
                                122
                            
                            
                        
                        Forks
                    Watchers
                    Open source project for data preparation of LLM application builders
NeMo-Curator
                            
                                542
                            
                            
                        
                        Stars
                    
                            
                                71
                            
                            
                        
                        Forks
                    Watchers
                    Scalable data pre processing and curation toolkit for LLMs