[flagd-ui] Spans with error

Changes

Switch flagd-ui navbar links from plain anchors to LiveView navigation (.link navigate) to avoid full page reloads and websocket teardown when switching between Basic and Advanced.

Fix LiveSocket URL construction under the /feature base path so the websocket connects to /feature/live (was /featurelive), preventing spurious disconnects.

Merge Requirements

For new features contributions, please make sure you have completed the following essential items:

[x] CHANGELOG.md updated to document new feature additions
[x] Appropriate documentation updates in the docs
[x] Appropriate Helm chart updates in the helm-charts

Maintainers will not merge until the above have been completed. If you're unsure which docs need to be changed ping the @open-telemetry/demo-approvers.

Oct 22 '25 20:10 jack5341

The committers listed above are authorized under a signed CLA.

:white_check_mark: login: julianocosta89 / name: Juliano Costa (34f66abb2cfe781b7d0e575f972f1847ce863109)

Oct 22 '25 20:10 linux-foundation-easycla[bot]

@jack5341 thx for that!

I still see an error when accessing the flagd-ui service though:

This error happens every time I open /feature.

Oct 23 '25 06:10 julianocosta89

Did you guys faced something like that before in this repository?

Not only the CSS but also the JS files are unreachable, so I can’t make any function calls. Even when I switch to another branch, I still encounter the same problem. Yesterday, everything was working fine, this issue started today.

Additionaly I use Orbstack for docker.

My console

GET http://localhost:32935/feature/assets/css/app-6f5d86242cf5220b8531adc7351da8bc.css?vsn=d net::ERR_ABORTED 404 (Not Found)Understand this error
(index):10  

GET http://localhost:32935/feature/assets/js/app-fb088dfe3c12b4ebb739348d1a2a3a57.js?vsn=d net::ERR_ABORTED 404 (Not Found)

Oct 23 '25 13:10 jack5341

Did you guys faced something like that before in this repository?

Not really, and today I was able to run your branch fine. I had to access the /feature to get the traces with error, and everything worked fine

Oct 23 '25 14:10 julianocosta89

Did you guys faced something like that before in this repository?

Not really, and today I was able to run your branch fine. I had to access the /feature to get the traces with error, and everything worked fine

you have any idea what can cause this problem?

Oct 23 '25 14:10 jack5341

To help other users see that issue, make sure the project is always initiated with make start command. I am on my way again.

Oct 23 '25 15:10 jack5341

Hey @julianocosta89, I’ve fixed it. The issue was caused by the protocol switch during WebSocket connections. HTTP 101 responses were being treated as errors, which is actually expected behavior for WebSockets. The simple and reliable fix was to add a filter to the otel-collector processor to set http.status_code 101 entries to UNSET.

Oct 24 '25 14:10 jack5341

update

Nov 02 '25 21:11 jack5341

hey @jack5341 now I'm not sure if we should fix this on the demo or if we should open an issue on the instrumentation repo. If 101 shouldn't be an error, the instrumentation is wrong and it should be fixed.

WDYT?!

Nov 06 '25 17:11 julianocosta89

I’d suggest fixing this in the instrumentation repo as well as here, since we don’t know when it will be released and when it might affect this repository.

Nov 06 '25 18:11 jack5341

In that case we should do the following:

[ ] Open an issue in the Elixir instrumentation repo
[ ] Add a comment with the link to the issue in the Collector rule here
[ ] Limit the scope of the rule to only Flagd-UI spans (at the moment the rule is configured to update ALL spans with status code 101)

Nov 07 '25 19:11 julianocosta89

In that case we should do the following:

[ ] Open an issue in the Elixir instrumentation repo

[ ] Add a comment with the link to the issue in the Collector rule here

[ ] Limit the scope of the rule to only Flagd-UI spans (at the moment the rule is configured to update ALL spans with status code 101)

1- Is this repository the correct one for Elixir? 2- Do you mean I should add a comment to the changes here and include a link to the issue created in the Elixir repository? 3- Why not keep it global? I mean, 101 shouldn’t really be treated as an error, am I wrong?

Nov 07 '25 20:11 jack5341

1- Is this repository the correct one for Elixir?

yes

2- Do you mean I should add a comment to the changes here and include a link to the issue created in the Elixir repository?

Yes, same as we have done here: https://github.com/open-telemetry/opentelemetry-demo/blob/main/src/otel-collector/otelcol-config.yml#L150

3- Why not keep it global? I mean, 101 shouldn’t really be treated as an error, am I wrong?

Good question. I wonder if other instrumentations set the span status to error. If so then this is a specification problem.

I can take a further look next week on that.

Nov 07 '25 22:11 julianocosta89

This PR was marked stale due to lack of activity. It will be closed in 7 days.

Nov 15 '25 03:11 github-actions[bot]

This PR was marked stale due to lack of activity. It will be closed in 7 days.

Nov 25 '25 03:11 github-actions[bot]

3- Why not keep it global? I mean, 101 shouldn’t really be treated as an error, am I wrong?

I've tried to find a reasoning behind this 101 being treated as an Error, but I couldn't find anything. https://opentelemetry.io/docs/specs/semconv/general/recording-errors/#what-constitutes-an-error

@tsloughter according to @jack5341 https://github.com/open-telemetry/opentelemetry-demo/pull/2677#issuecomment-3443325727:

The issue was caused by the protocol switch during WebSocket connections. HTTP 101 responses were being treated as errors, which is actually expected behavior for WebSockets.

Is there any reason why 101 would be an error in this context?

Nov 25 '25 12:11 julianocosta89

3- Why not keep it global? I mean, 101 shouldn’t really be treated as an error, am I wrong?

I've tried to find a reasoning behind this 101 being treated as an Error, but I couldn't find anything. https://opentelemetry.io/docs/specs/semconv/general/recording-errors/#what-constitutes-an-error

@tsloughter according to @jack5341 #2677 (comment):

The issue was caused by the protocol switch during WebSocket connections. HTTP 101 responses were being treated as errors, which is actually expected behavior for WebSockets.

Is there any reason why 101 would be an error in this context?

OpenTelemetry pipelines it may appear as one because of how certain instrumentation interprets “non-final responses” or “upgrade responses” during HTTP

Nov 25 '25 15:11 jack5341

This PR was marked stale due to lack of activity. It will be closed in 7 days.

Dec 03 '25 03:12 github-actions[bot]

Bump.

The error span is actually from envoy, right? What is the current status on this? Something going wrong with how the Elixir service is closing or not closing the websocket?

Dec 03 '25 10:12 tsloughter

I think the issue is that the flagd-ui is not actually closing the websocket in a timely manner. This can be found in the network tab of the browser:

Envoy tags it as error because it didn't get a response from flagd-ui, so I still believe the error is on the Elixir side of things.

Dec 04 '25 10:12 julianocosta89

Do you know how it gets asked to close it? Is it supposed to time out after not having more flags requested for a period and close or is the client ending the connection?

Dec 04 '25 10:12 tsloughter

Do you know how it gets asked to close it? Is it supposed to time out after not having more flags requested for a period and close or is the client ending the connection?

Not really, this is handled by Envoy, right?!

Dec 04 '25 11:12 julianocosta89

The websocket will stay open until a timeout or it is told to close, so wondering if this is a case of the client closing or a timeout expected to go off on the server side. Or envoy could have a timeout that it'll close websockets after.

Dec 04 '25 11:12 tsloughter

This PR was marked stale due to lack of activity. It will be closed in 7 days.

Dec 12 '25 03:12 github-actions[bot]

Closed as inactive. Feel free to reopen if this PR is still being worked on.

Dec 19 '25 03:12 github-actions[bot]