Mayur Bhosale issues

Results 13 issues of


                                            Mayur Bhosale

EventLoss analyzer for detecting event loss

Loss of critical events like StageEnd, JobEnd, ExecutorAdded, etc leads to inaccurate reports. https://github.com/qubole/sparklens/issues/56 highlights this problem. We should write an EventLossDetector Analyzer which will detect the event loss and...

Report driver memory statistics with QuboleJobListener

Following metrics related to the Driver are now reported - 1. driverHeapMax => Max Heap memory allocated to the driver JVM 2. driverMaxHeapCommitted => Max Heap memory committed to the...

fix for exception while running sparklens with event history file in spark 2.4.0 and above.

- Sparklens is compiled against spark 2.0.0 - Spark 2.0.0 has a dependency on json4s 3.2.11 wherein Spark 2.4.0 onwards json4s 3.5.3 is used - In the later version the...

Stages nodes

- Added a framework for simplifying adding any metrics to Sparklens report - Mapping the nodes of the sparkplan to the stages they were executed as a part of -...

Auto refreshing Access token

As a part of this [change](https://github.com/GoogleCloudDataproc/spark-bigquery-connector/pull/146) we have added support for access token-based authorization. But considering the Access token has a short expiry (50 mins from generation), this becomes an...

[WIP] Unsafe shuffle writer support in RSS

**Key traits** - Stores the map output data in serialized form - Buffers the data in memory as much as possible. Chunk the data before sending it to RSS servers....

[WIP] Add tolerance in RSS cluster for server going away

Adds fault tolerance in RSS servers for one or more server going away. This is how the functionality works - Node/server goes away - Task reading/writing data from that server...

[Proposal] Unsafe memory management in RSS mappers

Mappers in RSS send shuffle data for any given partition to a single RSS servers, so that reducers can read the shuffle data from a single location. To incorporate this,...

FetchNode does not fetch a any links from the webpage

**Describe the bug** FetchNode currently only fetches the static html content from the page and does not fetch any links. Without that multi-level scrapping won't be possible **To Reproduce** ```...

enhancement

fix: Augment the information getting fetched from a webpage

These are follow-up changes from the discussion https://github.com/VinciGit00/Scrapegraph-ai/issues/187 We are now adding a mechanism to fetch the contents of the webpage using beautifulsoup. Apart from the header and body are...