jgit-spark-connector
jgit-spark-connector copied to clipboard
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
## Expected Behavior I have been accessing siva files and get WARN messages. The question is whether it happens due to corrupted data or tools? Are these warnings expected to...
classifyLanguages uses enry and return human readable name while bblfsh uses different one. Signed-off-by: Maxim Sukharev
Your issue may already be reported! Please search on the [issue track](../) before creating one. ## Expected Behavior ## Current Behavior 18/10/17 13:19:18 WARN Repository: close() called when useCnt is...
The best way to get people to try your technology is to reduce time to first "whoa" moment. In order to do so @eiso created a Dockerfile that allows you...
This PR fixes a simple bug I found when running the notebook on the engine docker image. Basically, the command: ``` engine = Engine(spark, "/repositories", "standard") ``` Failed with the...
This feature request is a followup on initial discussion (focused on correctness initially) started by @EgorBu https://github.com/src-d/engine/issues/165 Engine already handles Bblfsh errors well. Now the idea is to somehow expose...
When running on local mode with `--packages "tech.sourced:engine:0.6.3"` - extracting UASTs works. But after switching to actual Apache Spark cluster with the same params and query i.e in Standalone mode...
Right now, if there is an index filter all history is traversed and then filtered. We could skip that altogether and just don't traverse the history if we have a...
## Expected Behavior The following query should not crash: ```sql -- Repository count per language precense. -- If any file in language X is present in HEAD of a repository,...
## Expected Behavior When I get the join optimization error (i.e. `This Join could not be optimized. This might severely impact the performance...`) I should be able to read the...