Vadim Markovtsev comments

Results 259 comments of


                                            Vadim Markovtsev

[PGA] Pointer redeclared during import "unsafe" (borges-indexer on Windows)

Oh and there is another option: SQL interface to the underlying repos via [gitbase](https://github.com/src-d/gitbase)

[PGA] Pointer redeclared during import "unsafe" (borges-indexer on Windows)

As far as I know gitbase+Spark integration is not ready yet. But yep this is the goal. So the only way to run PySpark over siva atm is through jgit-spark-connector...

Update pga.sourced.tech

Redirect @ajnavarro

Update command

I strongly +1 this as a casual user. Please add this!

Question on full PGA integrity verification

@bzz It must be 3TB, not 2.4. Either something went wrong during the download or the index misses some repos. We measured 3TB from our local HDFS copy. This is...

Question on full PGA integrity verification

@campoy I have an impression that we are reinventing a huge wheel here, but I cannot list any particular prior. I note that the Torrent protocol can be handy here:...

Question on full PGA integrity verification

@bzz I would collect the list of file names with sizes and compare it to the list retrieved from the server (you can ask Rafa to run any listing command...

Question on full PGA integrity verification

The number of lines in index matches, the number of siva files should be around 270k. This means 30k were not indexed and it is very, very bad.

Question on full PGA integrity verification

So before moving forward, we need to index the siva files which were discarded.

Question on full PGA integrity verification

@bzz This is great news! I am so happy you failed to download them two times and this is not an indexing issue!