PowerShellGallery icon indicating copy to clipboard operation
PowerShellGallery copied to clipboard

PowerShell Gallery Extremely Slow Publishing and Access

Open boblodgett opened this issue 1 year ago • 24 comments

Prerequisites

  • [X] Write a descriptive title.
  • [X] Make sure you are able to repro it on the latest version
  • [X] Search the existing issues.

Steps to reproduce

Starting within the last 24 hours PowerShell Gallery has become extremely slow. I do not see anything on the status page yet but there should be.

Expected behavior

Working with normal speed.

Actual behavior

It is extremely slow.

Error details

No response

Environment data

N/A

Version

N/A

Visuals

No response

boblodgett avatar May 16 '24 01:05 boblodgett

@boblodgett Thanks for letting us know, we've been doing some deployments which may have caused a bit of latency, though it shouldn't be that extreme. Do you know when exactly this started happening or just that it was roughly 24 hours ago? Things should be stable in terms of deployments, but please keep us updated if this issue continues or resolves. We'll investigate and see if we can pinpoint what's going.

alerickson avatar May 16 '24 02:05 alerickson

@alerickson our metrics show that the slowness with FindPackagesById() API call started around 2024-05-15 17:10 UTC and was about 6 times slower than the baseline and then started to get significantly better from 2024-05-16 00:40 UTC. Its still not as good as it used to be, about 40% slower than the baseline.

afroz429 avatar May 16 '24 03:05 afroz429

@alerickson appears to be closer to baseline from 2024-05-16 15:10 UTC

afroz429 avatar May 16 '24 16:05 afroz429

...we've been doing some deployments which may have caused a bit of latency,...

@alerickson how much latency was added to the system? Should we be seeing job steps that are going from 12 minutes up to 60+ minutes for downloading modules to our agents in Azure DevOps Service (use Azure VMSS deployed agents so our steps to install dependent modules has to run in every run).

wsmelton avatar May 20 '24 14:05 wsmelton

@alerickson - All of our builds are now failing/timing out when we use Invoke-PSDepend. Something is now broken.....

carpnick avatar May 20 '24 14:05 carpnick

@alerickson @SteveL-MSFT We are unable to publish modules at the moment. Are you folks working on a fix?

tksrc avatar May 20 '24 15:05 tksrc

@SydneyhSmith - FYI - serious performance issues with Powershell gallery again. Has been this way since late last week.

carpnick avatar May 20 '24 18:05 carpnick

Making some changes this morning and monitoring-- hopefully should be improving shortly

SydneyhSmith avatar May 20 '24 18:05 SydneyhSmith

Discovered this post after noticing that PowerShell modules which used to update in seconds, are now taking 5-30+ minutes. Still haven't seen Microsoft.Graph finish installing. Perhaps a load balancing, server I/O, or software timeout issue on the server side? Good luck with the fix!

sterlingtalbott avatar May 20 '24 21:05 sterlingtalbott

Performance is still very bad - 1 day later.

carpnick avatar May 21 '24 13:05 carpnick

Same for us. It was "OK" briefly in the morning (UK time) and then it went super slow again.

tksrc avatar May 21 '24 13:05 tksrc

I have been trying to install Azure Powershell for over 24 hours with no success.

image

JasonGore avatar May 21 '24 15:05 JasonGore

our pipelines have been experiencing the slowness (causing failures) since about Noon Central Time yesterday. It seems to happen in waves, and will occur for an hour or two, and then things work, and then later another hour or two of failures.

scottmcburney avatar May 21 '24 15:05 scottmcburney

Thanks for the continued updates, our monitors are showing that the Gallery is healthy at this point, after we have made updates, but we are doing deeper investigation based on these reports

SydneyhSmith avatar May 21 '24 16:05 SydneyhSmith

I've had a ticket open with the graph team since yesterday. It was recommended that I drop the ticket number here -

2*0007771

We've had pipelines fail with End of Central Directory could not be found. We pretty much build everything on demand for our infrastructure and deployments so we're completely down when these things timeout.

It seems like its a bit better today but there is still some huge delays. Any updates on what you're seeing on your end?

image

crownmike avatar May 21 '24 16:05 crownmike

Our monitors show it is still working very slowly. These problems started shortly before I opened this ticket and have not completely recovered since.

image

The requests are timing out for us:

https://www.powershellgallery.com/api/v2/FindPackagesById()?id='AWS.Tools.AWSSupport'&$inlinecount=allpages&$filter=IsLatestVersion%20and%20Id%20eq%20'AWS.Tools.AWSSupport'. Attempt 1 of 3. 
{
    "Url": "https://www.powershellgallery.com/api/v2/FindPackagesById()?id='AWS.Tools.AWSSupport'&$inlinecount=allpages&$filter=IsLatestVersion%20and%20Id%20eq%20'AWS.Tools.AWSSupport'",
    "Retry": 1,
    "MaxRetries": 3,
    "Exception": {
        "ClassName": "System.Threading.Tasks.TaskCanceledException",
        "Message": "The request was canceled due to the configured HttpClient.Timeout of 30 seconds elapsing.",

boblodgett avatar May 21 '24 17:05 boblodgett

We are also seeing issues across all of our environments with all different types of modules. Sometimes gets an error, sometimes it never completes and is just suspended. Have not seen this issue before

Kambz23 avatar May 21 '24 17:05 Kambz23

We just finished another deployment-- let me know if you are seeing any improvement

SydneyhSmith avatar May 21 '24 18:05 SydneyhSmith

What were the causes of the issues?

goldjg avatar May 21 '24 19:05 goldjg

The initial cause points to high load on the service--- we are still determining the full series of events as to why the service did not scale as expected

SydneyhSmith avatar May 21 '24 19:05 SydneyhSmith

The initial cause points to high load on the service--- we are still determining the full series of events as to why the service did not scale as expected

It does seem a lot more performant right now.

Install of Az module was much faster.

Now testing install of Microsoft.Graph module which as well as being slow was also giving Package failed to install because: End of Central Directory record could not be found. earlier.

goldjg avatar May 21 '24 20:05 goldjg

Yeah definitely a lot better!


VERBOSE: Module 'Microsoft.Graph' was installed successfully to path '/home/prometheus/.local/share/powershell/Modules/Microsoft.Graph/2.19.0'.
PS /home/prometheus>

goldjg avatar May 21 '24 20:05 goldjg

It has currently improved to normal levels:

image

boblodgett avatar May 21 '24 20:05 boblodgett

Performance is much better on my end is well.

carpnick avatar May 23 '24 22:05 carpnick