gemini
gemini copied to clipboard
Perf: measure performance of file-based similarity
Umbrella issue for adding perf measurements to every command for file-based similarity:
- Add CLI flag for ./query ./report to print time for each stage
- Instrument FE, exposing one endpoint \w json
- Instrument Apache Spark hashing job using
org.apache.spark.groupon.metrics.UserMetricsSystemto expose to Spark JSON endpoint