websockets icon indicating copy to clipboard operation
websockets copied to clipboard

Support for custom headers for handshake

Open Misiu opened this issue 7 years ago • 104 comments

Hi,

please consider adding ability to add custom headers for handshake. In RFC6455 there one interesting point:

The request MAY include any other header fields, for example, cookies [RFC6265] and/or authentication-related header fields such as the |Authorization| header field [RFC2616], which are processed according to documents that define them.

I've found an example how to add custom header to handshake: https://blog.heckel.xyz/2014/10/30/http-basic-auth-for-websocket-connections-with-undertow/ but this is for Java and unfortunately isn't possible in HTML5.

When searching over the net I found many places question about this option, for example: https://github.com/sta/websocket-sharp/pull/22 https://stackoverflow.com/questions/4361173/http-headers-in-websockets-client-api/4361358#4361358 https://github.com/aspnet/SignalR/issues/888

For example in Python this is possible https://stackoverflow.com/questions/15381414/sending-custom-headers-in-websocket-handshake. Other languages also support this. Last place missing is the browser.

Please consider adding this into specification. Having this even as a draft would allow us to consider browser vendors to add support for it.

If this is incorrect place for adding request about specification please forgive me and please point me to right place.

Misiu avatar Sep 22 '17 18:09 Misiu

We'd need implementer interest in this before moving forward, similar to https://github.com/whatwg/html/issues/2177.

domenic avatar Sep 23 '17 02:09 domenic

Mozilla - https://support.mozilla.org/en-US/questions/1176810 Chromium - https://bugs.chromium.org/p/chromium/issues/detail?id=768328 EDGE - https://wpdev.uservoice.com/forums/257854-microsoft-edge-developer/suggestions/31600090-websocket-support-for-custom-headers-for-handsha

@domenic I'll add feature request for other browsers soon. Hopefully some of them will show some interest.

Misiu avatar Sep 25 '17 11:09 Misiu

Thank you Misiu for filing the feature request and the bugs for each vendor.

Regarding complexity, for the web platform, this would require non-trivial amount of work at both spec and impl side.

To introduce custom header feature, we need to make it issue a CORS preflight. Though this is not an HTTP API, considering the original reason why we introduced the CORS preflight, we need it for WebSocket + custom headers.

On Chromium, we have separate stacks for WebSocket and XHR/fetch() (for which we have CORS logic). Some non-trivial refactoring and gluing is needed.

tyoshino avatar Sep 26 '17 02:09 tyoshino

I am one of the authors and maintainers of Chrome's WebSocket implementation. I oppose this proposal.

Reasons:

  • The WebSocket handshake security model hinges on exposing no more capabilities for request forgery than are already possible using an img tag. It is not possible to add arbitrary headers to the request using an img tag.
  • Apart from the limited sub-protocol negotiation, the WebSocket handshake is not intended as a means to exchange metadata about the connection. Non-browser implementations may use it this way, but I would not recommend it. It is better to think of it as a low-level operation like a TCP/IP SYN packet.
  • It would be possible to add this to the WebSocket protocol by using a CORS preflight. However, the utility provided simply doesn't justify the additional implementation, standardisation and maintenance cost.

ricea avatar Sep 26 '17 02:09 ricea

@tyoshino @ricea thank You guys for Your reply. My main idea was to add ability to add Authentication header to WebSockets.

If You look at SignalR repo You will notice that many people are looking for a way to pass authentication via WebSocket - https://github.com/aspnet/SignalR/issues/888#issuecomment-346242065

Right now the only option to do this is to add token as part of query string. To simplify this only Authorization header is required.

Misiu avatar Nov 27 '17 12:11 Misiu

@Misiu In the specific case of WebSockets I think passing an authentication token in the URL is okay. The reason is that, unlike HTTP URLs, wss: URLs are never exposed to the user. They can't bookmark them or copy-and-paste them. This minimises the risk of accidental sharing. In addition, their appearance in other web APIs is minimal [1]. For example, they won't appear in history. This reduces the risk of leakage via JS APIs.

The best thing from a security perspective would probably be to perform authentication after the WebSocket is connected, but I realise it is undesirable from a resource-usage perspective to permit unauthorised connections to be established in the first place.

Given the portability difficulties with using cookies or http auth, a short-lived authentication token in the URL is probably your best option at the moment.

A specific note with respect to the Authorization header is that normally you'd need to be able to handle a 401 response to use it. But in the WebSocket API error responses are never exposed to the page for security reasons. So that wouldn't work very well.

[1] I actually can't think of any other APIs that expose ws: or wss: URLs at all. Certainly they don't appear in resource timing.

ricea avatar Nov 27 '17 13:11 ricea

I cannot use WebSocketSharp because it's missing the ability to customize authorization headers. I am trying to integrate with a WebSocket api that takes in OAuth2 bearer tokens.

digitalpacman avatar Mar 15 '18 22:03 digitalpacman

@ricea That's not the main reason why you don't put authentication values in URLs. It's because of server logs. Auth tokens should be treated with the same security concern as username and password combos. You don't put username and password combinations in urls, so neither should auth tokens. If you have a vulnerability and a user downloads your http logs, they then have access to virtually all of your accounts.

digitalpacman avatar Mar 15 '18 22:03 digitalpacman

@ricea sadly implementers don't get to make the decision of what is required to use an api. I am currently trying to use an api that REQUIRES oauth2 bearer tokens in the authorization header for server-to-server communication. They offer cookies for browser based security, but I do not have that luxury because I have no sessionids for the user. So all you're doing by saying "we should not do this" is saying "you have to look elsewhere to implement with this api". Which would simply just drive people away from using this implementation. You aren't "saving" anyone but not allowing this, just alienating users.

digitalpacman avatar Mar 15 '18 22:03 digitalpacman

I encountered this issue looking for solution on my problem - how to remove sid parameter from URL that is set and used by websockets. Now I see @ricea suggestion of adding short living token to URL with explanation that "wss: URLs are never exposed to the user."

OWASP Zap report is suggesting sid param removal from URL, also this question on stackoverflow is refering that CloudFlare is also suggesting sid param removal from URL: https://stackoverflow.com/questions/42759556/how-to-remove-socket-io-sid-parameter-from-url

So my question is, @ricea, do you think that this reported issue is completely irrelevant because of fact that ws/wss urls are never exposed to users?

maleta avatar Apr 11 '18 09:04 maleta

@maleta

So my question is, @ricea, do you think that this reported issue is completely irrelevant because of fact that ws/wss urls are never exposed to users?

Security measures need to be considered in terms of what threats they are intended to protect against and to what extent they are effective in mitigating those threats.

In the case of the Cloudflare vulnerability, placing the sid parameter in a header rather than in the URL would have provided no protection whatsoever.

Quoting https://bugs.chromium.org/p/project-zero/issues/detail?id=1139:

... we observed encryption keys, cookies, passwords, chunks of POST data and even HTTPS requests for other major cloudflare-hosted sites from other users.

This is the trouble with checklist security. If you removed the sid parameter from the URL, you could check that item off the list and feel that you'd improved your security. But in practice there's no improvement at all.

@digitalpacman If someone has access to your server logs, they can do much worse things than just steal credentials. There's usually all kinds of personally-identifiable information in there. Access needs to be tightly restricted just as with any other kind of user data.

ricea avatar Apr 12 '18 11:04 ricea

@ricea there is only personally-identifiable information in there if you aren't doing good practices. At most there should be an IP address, but you can opt to not record that either. Not putting SECURITY CREDENTIALS in the url is an extremely popular best practice. Infact, governing bodies for security will instantly fail you if you are passing them in the URL.

Please look at the OAuth2 spec for example and explanation of these vulnerabilities. There is a reason it REQUIRES them in the POST body.

digitalpacman avatar Apr 12 '18 12:04 digitalpacman

@ricea is this perhaps something we can reconsider if https://wicg.github.io/origin-policy/ becomes a thing? That should nullify most of the CORS preflight cost.

The reason I'm somewhat sympathetic to the requests is because I learned that middleboxes basically ruin full duplex HTTP so WebSocket will likely stick around.

annevk avatar Apr 12 '18 12:04 annevk

@annevk I think origin policy will probably be implemented in the browser in the same place as CORS, so it would still mean re-wiring the handshake to go down the same code path.

The WebSocket handshake started out as purely a mechanism to negotiate a WebSocket connection. Over time we've added HTTP features like cookies and authentication, but we've paid a high price in implementation complexity.

The "right" way to do WebSocket authentication is to do it at the application layer after the handshake completes. No-one asks how to put an oauth token in a TCP/IP SYN packet. But the reality is that we've ended up in a fuzzy middle-ground that is hard to explain.

ricea avatar Apr 12 '18 12:04 ricea

@ricea given that we do add browser-supplied cookies and authentication, it's quite a reasonable request that we also enable headers, since much WebSocket server-side infrastructure probably depends on such information being in the handshake.

And at least as far as the specification is concerned the handshake shares many aspects with Fetch, to enable mixed content blocking, HSTS, CSP, etc. So from that perspective this is not that much of a stretch. (I wonder if Chrome's approach is shared by other browsers.)

annevk avatar Apr 12 '18 13:04 annevk

@ricea

The WebSocket handshake security model hinges on exposing no more capabilities for request forgery than are already possible using an img tag. It is not possible to add arbitrary headers to the request using an img tag.

Can you elaborate here. I don't understand why the security model hinges on not allowing headers to be set in the client API? I'm not sure I understand this.

davidfowl avatar Jun 07 '18 05:06 davidfowl

@davidfowl part of the same-origin policy is that we don't send "attacker-controlled" headers to cross-origin URLs. That's why if you want to do that you need to use CORS, which uses a CORS preflight to make an explicit check that the cross-origin URL is okay with the headers about to be transmitted.

The objection here, at least from the Chrome team, is that supporting a CORS preflight for WebSocket is too costly.

annevk avatar Jun 07 '18 05:06 annevk

@annevk Thanks for that clarity!

So attacker controlled headers are seen as more dangerous than attacker controlled query string. Would it help if we restricted the types of headers that could be sent?

davidfowl avatar Jun 07 '18 06:06 davidfowl

Yes, you can basically reach any arbitrary URL (whatever the query string or path), but you can only control headers to a very limited extent (and methods too, only HEAD/GET/POST).

The request headers we allow "attackers" to control are listed at https://fetch.spec.whatwg.org/#cors-safelisted-request-header, but note that we plan to lock that down some more in https://github.com/whatwg/fetch/pull/736. For the kinds of use cases that people seem to have I don't think allowing these would be sufficient (or it would lead to hacks where you put authorization data in Accept-Language or some such).

annevk avatar Jun 07 '18 06:06 annevk

@annevk The current design forces people to send what would typically go in the the Authorization header in the query string. Is that any less secure than allowing say the authorization header to bet set on the upgrade request? (even cross origin)

davidfowl avatar Jun 07 '18 07:06 davidfowl

The concern is not with the security of the application making the request, it's with the security of the remote server. The remote server will have to be robust against arbitrary URLs, but it does not have to be robust against arbitrary headers, since it'll assume (due to the long history of the web and early establishment of the same-origin policy) that those cannot come from browsers.

annevk avatar Jun 07 '18 07:06 annevk

I see, understand the push back now. So there needs to be a pre-flight request integrated into the flow and that's expensive in chrome because of the current implementation split between websockets and xhr/fetch. Is that a fair summary?

davidfowl avatar Jun 07 '18 08:06 davidfowl

Excellent summary.

annevk avatar Jun 07 '18 08:06 annevk

Should we get some of the other browser implementors to chime in? I can probably get somebody from the edge team to chime in here about the difficulty there. I'm not sure how to go about getting the other browser vendors interested enough to look at this issue (safari, firefox, anything else?)

davidfowl avatar Jun 07 '18 08:06 davidfowl

Maybe @mcmanus for Firefox and @youennf for Safari? Feedback from Edge would be good.

annevk avatar Jun 07 '18 08:06 annevk

I'm kinda surprised people aren't smuggling this through subprotocol - seems obvious enough.

architecturally speaking, a CORS preflight wouldn't be a big burden for firefox to implement.

otoh, designs that result in a lot of preflights suck so I'm not convinced we want to enable this rather than pushing it into the post websocket-handshake data.

mcmanus avatar Jun 07 '18 09:06 mcmanus

rather than pushing it into the post websocket-handshake data.

Then its a per web app custom protocol implementation; open to a slow loris type, slow auth issue from an unknown user; depending on how the application developer implements it; vs a well understood header handling implementation by the webserver?

benaadams avatar Jun 07 '18 09:06 benaadams

@davidfowl Actually yes it is inherently less secure to send it via querystring. This is why the OAuth spec requires the credentials to be passed in headers or body, always. The reasoning is because log files of webservers and routers often log the URL including the querystring parameters for identifying requests. Therefore, without monitoring every system in the entire world yourself you rely on them properly stripping credentials from their logs to protect people. That entire problem disappears when you send sensitive credentials in the headers or body because those are rarely, if ever, logged. Mainly because of their size and the understanding that it is not directly secure. Some headers are logged by webservers, but those would be exclusively called out.

digitalpacman avatar Jun 07 '18 09:06 digitalpacman

@digitalpacman I'm fully aware. I'm mostly trying to get the implementors to agree we should try to solve this problem.

otoh, designs that result in a lot of preflights suck so I'm not convinced we want to enable this rather than pushing it into the post websocket-handshake data.

The problem is that you've left the realm of http auth here and you now have to invent a mini protocol over the websocket itself. You also need to let the "attacker" successfully establish a connection to your server, and you now have to wait for data to be sent over it (which needs to timeout appropriately) to authenticate. So you end up with completely different authentication methods for websocket and non-websockets clients.

davidfowl avatar Jun 07 '18 16:06 davidfowl

@davidfowl I've given up the fight. I think we should fork, fix, and ignore this repository for life. I physically switched from c# to nodejs because of this. I don't get a choice in how a websocket server I'm integrating with does it's authing protocol. It could be custom, it could be headers, it could be url, it could be in a postflight check, or preflight check. A libraries job is not to dictate HOW integrations must occur, unless it's implementing a strict standard like OAuth2. The Websocket spec specifies header authing is available, so it should be available.

I physically cannot use this library because of the resistance of the owner. The library I am integrating with requires that a preflight OAuth2 http request occurs to receive a token that is passed within a Bearer authorization header. I get no choice in the matter.

digitalpacman avatar Jun 07 '18 17:06 digitalpacman