dblp
dblp copied to clipboard
Tasks to summarize data
For a complete dataset, generate a summary of salient characteristics, such as:
- number of nodes and edges for each graph, diameter, avg. degree
- number of documents, terms, and nonzeros in each corpus, quantiles on term count
- proportion of papers with abstracts
- ground truth stats: # venues, quantiles on comm. size