performance-analyzer-rca
performance-analyzer-rca copied to clipboard
The Performance Analyzer RCA is a framework that builds on the Performance Analyzer engine to support root cause analysis (RCA) of performance and reliability problems for Elasticsearch instances.
While setting up a dev environment for Open Distro, one of my nodes has substantial log spam of the following: ``` Sep 24 12:16:15 masternode1 performance-analyzer-agent-cli[26382]: 12:16:15.689 [grpc-default-executor-21] ERROR com.amazon.opendistro.elasticsearch.performanceanalyzer.rca.net.SubscribeResponseHandler...
The following test failed when building PA-RCA: ``` com.amazon.opendistro.elasticsearch.performanceanalyzer.rca.integTests.tests.jvmsizing.HeapSizeIncreaseMissingMetricsTest Test testMissingHeapMetrics FAILED (3m 36s) java.lang.AssertionError: Values should be different. Actual: HeapSizeIncreaseAction ```
The following test failed when building PA-RCA: ``` com.amazon.opendistro.elasticsearch.performanceanalyzer.rca.integTests.tests.consolidate_tuning.JvmFlipFlopITest Test testJvmActions FAILED The Runner found the following errors in log: [ 23:11:59.827 [ELECTED_MASTER-task-1-] ERROR NodeConfigCacheReaderUtil:readQueueCapacity()::line:40 - Exception while reading queue...
OSMetricsCollector#collectMetrics() is responsible for persisting Thread related info in 'jTidMap' In case the 'jTidMap' has not been updated and threadState is null, we return -1 when [getNativeThreadId](https://github.com/opendistro-for-elasticsearch/performance-analyzer-rca/blob/3749a2668f2947df68cfc4817eba3af0ad9b6fb3/src/main/java/com/amazon/opendistro/elasticsearch/performanceanalyzer/metrics/ThreadIDUtil.java#L30)() is invoked. We...
Hello! I've just upgrade my ODFE cluster and now i'm seeing a lot of these erros regarding to performance analyzer. Is there any workaround for this? Thanks in advance! performance-analyzer-agent-cli:...
We've added admission control metrics as part of https://github.com/opendistro-for-elasticsearch/performance-analyzer-rca/issues/570 Those metrics and other cluster metrics would be used to fine-tune AdmissionControl threshold for various controllers. --- **ENABLE/DISABLE SETTINGS:** 1. Ability...
*Fixes #:* *Description of changes:* TotalError and ReaderRestartProcessing will stop showing in service log entry counter if the value is 0 Add default MethodName CollectStatsMetric when MethodName is not provided...
This Metric will Publish the Only Election term number *Tests:* Docker Container Testing *1. Schema of Election term* ``` sqlite> .schema Election_Term CREATE TABLE Election_Term(sum double null, avg double null,...
Master Pending Queue size per task type Earlier we were publishing Total number of pending task. For RCA analysis Pending Queue size per task type will give better understanding. Changed...
The changes are made in accordance of how we are getting fault detection metrics from ES. Fault Detection metrics in ES will be published in a manner similar to ingest...