dblp issues

config is not recognized while executing the code

5

I am facing problem while running pipeline.py, as i am getting problem in config module

paper.csv is too large to save in my computer

1

When I tried to run the pipeline, paper.csv was generated from Miner-Papertxt (about 2.2G). And the paper.csv file was too large (exceeded 1.7T) but my computer has only about 2T...

rebecca312

help wanted

Repdocs Module Documentation

Could you please add descriptions for each file in the repdocs module. I'm trying to use this parser for my projects and am unclear what all the files contain and...

psombe

help wanted

documentation

Author <id> to <name> mapping

2

I was able to run the whole project. But i am not sure to from where do i get author id to author name mapping ?

prakhar21

documentation

Add complete() mechanism to BuildDataset

It appears that `Task`s with no output are supposed to implement a custom `complete()` method, since completion normally means all the output files exist. We should either make the outputs...

macks22

Perform author name disambiguation to produce new mapping

2

From the data, it appears the AMiner group did not perform any name disambiguation. This has led to a dataset with quite a few duplicate author records. This package currently...

macks22

enhancement

help wanted

Build co-authorship network

While the AMiner group already has a co-authorship network provided, it unfortunately does not allow for filtering by year ranges, which is a key feature of this library. Therefore it...

macks22

enhancement

Unit tests for each v1.0 Task

Should rely on a small portion of the real dataset that is representative in order to test.

macks22

enhancement

Tasks to summarize data

For a complete dataset, generate a summary of salient characteristics, such as: - number of nodes and edges for each graph, diameter, avg. degree - number of documents, terms, and...

macks22

enhancement

Thoroughly document each Task

macks22

documentation

dblp
dblp copied to clipboard

Metadata

config is not recognized while executing the code

paper.csv is too large to save in my computer

Repdocs Module Documentation

Author <id> to <name> mapping

Add complete() mechanism to BuildDataset

Perform author name disambiguation to produce new mapping

Build co-authorship network

Unit tests for each v1.0 Task

Tasks to summarize data

Thoroughly document each Task

← Metadata

Owner

Metadata

dblp dblp copied to clipboard

Metadata

← Metadata

Owner

Metadata

dblp
dblp copied to clipboard