David A Roberts
                                            David A Roberts
                                        
                                    > The most reliable way is going to be to add the content offline. Even the tar add command will hit the file descriptor issue :/ @whyrusleeping I am, still...
There's still a little bit of work that needs to be done, but I'm going to go ahead and call this the first complete (pdf+src+metadata) release of the arXiv archive:...
@NDuma GitXiv looks really cool, thanks! cc @samim23 @mekarpeles
@rht I'm not sure there is a script (yet), it was mostly a (semi-)manual process... :/
Ok, I think I'm going to have to put this on hold until we get more storage nodes. PubMed is absolutely massive. **The open access subset alone is currently weighing...
On hold until `ipfs add` performance improves, too many small files
@eminence awesome, thanks for tackling this and writing up the details :) Cc @whyrusleeping @rht @diasdavid
> Note that I'd love a mode in ipfs add that just gives me the top-level hash of the thing that I'm adding, so I can dispense with the tail...
@rht What percentage of the total runtime is currently consumed by the hash function?
@jbenet is there anything actionable here? If not, it might make sense to move these links to the wiki