runtime icon indicating copy to clipboard operation
runtime copied to clipboard

[QUIC] Long running test: System.Net.Quic.Tests.QuicStreamConformanceTests.Parallel_ReadWriteMultipleStreamsConcurrently

Open rzikm opened this issue 3 years ago • 3 comments

Occurrences 7/17-8/7 (incl. PRs):

Day Run
7/31 2x Rolling run
7/29 Rolling run
7/28 2x Rolling run
7/27 Rolling run
7/26 Rolling run
7/25 Rolling run
7/24 3x Rolling run
7/23 Rolling run
7/22 4x Rolling run + 2x PR #72670
7/18 3x Rolling run
7/17 3x Rolling run
before 7/16 Logs not available anymore
let timeouts = (friendlyNamePrefix : string, includePR : bool) {
cluster('engsrvprod.kusto.windows.net').database('engineeringdata').WorkItems
| where Status == "Timeout"
| where FriendlyName startswith friendlyNamePrefix
//| where Finished > datetime('2021-09-07')
| distinct JobId, WorkItemId, Name, FriendlyName, ConsoleUri
| join kind=inner (cluster('engsrvprod.kusto.windows.net').database('engineeringdata').Jobs
    | where ((Branch == 'refs/heads/main') or (Branch == 'refs/heads/master') or (includePR and (Source startswith "pr/")) or (Branch startswith 'refs/heads/release/6.0'))
    | where Type startswith "test/functional/cli/"
        and not(Properties contains "runtime-staging")
    | summarize arg_max(Finished, Properties, Type, Branch, Source, Started, QueueName) by JobId
| project-rename JobType = Type) on JobId
| extend PropertiesJson = parse_json(Properties)
| extend OS = replace_regex(tostring(PropertiesJson.operatingSystem), @'\((.*)\).*|([^\(].*)', @'\1\2')
| extend Runtime = iif(PropertiesJson.runtimeFlavor == "mono", "Mono", iif(PropertiesJson.DefinitionName contains "coreclr", "CoreCLR", ""))
| extend TargetBranch = extractjson("$.['System.PullRequest.TargetBranch']", Properties)
| extend Architecture = PropertiesJson.architecture
| extend Scenario = iif(isempty(PropertiesJson.scenario), "--", PropertiesJson.scenario)
| project-away PropertiesJson
| order by Finished desc
};
timeouts('System.Net.Quic', true);

Example occurrences: https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-1b78cb6570b34ab2b4/System.Net.Quic.Functional.Tests/1/console.f63e2e91.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-ef18348474c247e1ac/System.Net.Quic.Functional.Tests/1/console.ff884bf6.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-6c3d3e6a75d84c1f96/System.Net.Quic.Functional.Tests/1/console.c15ed8bb.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-290c3d971e8e4f11b9/System.Net.Quic.Functional.Tests/1/console.b8248e3f.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-46fba0508c4a4e3aab/System.Net.Quic.Functional.Tests/1/console.78a1ce4b.log?helixlogtype=result

Happens about 1-2 per day, and seems to have been happening for some time. Not constrained to a specific OS.

rzikm avatar Aug 04 '22 14:08 rzikm

Tagging subscribers to this area: @dotnet/ncl See info in area-owners.md if you want to be subscribed.

Issue Details

Example occurences: https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-1b78cb6570b34ab2b4/System.Net.Quic.Functional.Tests/1/console.f63e2e91.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-ef18348474c247e1ac/System.Net.Quic.Functional.Tests/1/console.ff884bf6.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-6c3d3e6a75d84c1f96/System.Net.Quic.Functional.Tests/1/console.c15ed8bb.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-4225a705f77145e2bb/System.Net.Mail.Functional.Tests/1/console.3e3899f3.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-290c3d971e8e4f11b9/System.Net.Quic.Functional.Tests/1/console.b8248e3f.log?helixlogtype=result https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-heads-main-46fba0508c4a4e3aab/System.Net.Quic.Functional.Tests/1/console.78a1ce4b.log?helixlogtype=result

Happens about 1-2 per day, and seems to have been happening for some time. Not constrained to a specific OS.

Author: rzikm
Assignees: -
Labels:

area-System.Net.Quic

Milestone: -

msftbot[bot] avatar Aug 04 '22 14:08 msftbot[bot]

Triage: We should disable the test due to higher impact on CI. We should figure out why it is happening in 7.0. Perhaps tweak the test to fail instead of hanging.

karelz avatar Aug 04 '22 16:08 karelz

Test disabled in #73446

karelz avatar Aug 07 '22 14:08 karelz

@ManickaP has a local repro, investigating now

karelz avatar Aug 11 '22 15:08 karelz

Based on info from @ManickaP -- it is a product bug. Await gets stuck when GC collects QuicListener. It just looks like deadlock, though it is not. Still pretty ugly for customers.

karelz avatar Aug 22 '22 12:08 karelz

Reopeing to track fix in 7.0 in PR #74740.

karelz avatar Aug 29 '22 15:08 karelz

Fixed in main (8.0) in PR #74450 and in 7.0 (for RC2) in PR #74740.

karelz avatar Aug 30 '22 13:08 karelz