Julien
Julien
**Describe the issue** Our tests are done on old alert data, with no diversity (all stars... or unknown). We should push new alerts with different classes (at least one representative...
Recent PR (https://github.com/astrolabsoftware/spark3D/pull/124) contains a bug fix that needs to be available in the Maven central repository. Action item: make a new release.
All is in the title: should we develop the package in Scala or Python? Scala: * Pros * Easy to work with Spark (native) * Geospark is in Java/Scala *...
*pyspark3d* issue. kNN search for data set size > 2G elements seem to go crazy :D I was running kNN for k=1000, and data set size = 5,000,000,000 elements. ```...
Idea 1 (fixed clustering): - load raw data - perform a k-means where k = number of partitions - repartition accordingly Idea 2 (dynamic clustering): - Load raw data -...
First steps logbook: #6 Week 5: - Added Octree Partitioned RDD support #36 - Add SphereRDD #38 - Code refactoring: Replace Sphere by ShellEnvelope, Shape3D heritage, and move everything under...
That would be good to have this feature... For the record: https://tools.ietf.org/pdf/rfc7946.pdf
OS: CentOS Linux release 7.4.1708 (Core) spark3D: 0.1.4 spark-fits: 0.6.0 #72 adds a script to benchmark the partitioning. The idea is the following: 1) Load data using spark-fits (10 millions)...
Would be good to have this feature... Syntax would be something like ```scala // Load the data as Shape3DRDD[Point3D] val objectRDD: Shape3DRDD[Point3D] = ... // Build the indices based on...
I came across [smile](https://github.com/haifengl/smile). Seems very promising! Examples are in preparation.