ic-http URI scheme for addressing content of canisters via their http interface
ic-http URI Scheme For Addressing Content of Canisters Exposed Via Their HTTP Interface
Introduction
Canister smart contracts on ICP can expose an HTTP interface that can be accessed from an HTTP client like a regular browser via an HTTP gateway. The HTTP Gateway Protocol Specification mandates regular HTTP URIs to specify a resource of a canister. That is, the HTTP URI for a canister contains the boundary node as the authority, or host name. This is inflexible as the URI contains a specific boundary node host name node and multiple different URIs with different boundary node hostnames refer to the same resource.
Example: <canister_id>.ic0.app/path/to/resource?par1=abcd&par2=efgh
Boundary node host names should be transparent for the addressing of HTTP resources. This standard introduces a new ic-http URI scheme which abstracts from the host name of the boundary node and thereby allows for uniquely referring to HTTP-exposed resources on the Internet Computer without reference to the boundary node's host name. This is crucial because with the opening up of the Internet Computer's edge infrastructure by allowing community-operated boundary nodes, there will be many host names available to access the same resource.
For example, an NFT standard can use this mechanism to specify in NFT metadata HTTP-based canister-hosted resources of NFTs. Parties who read the metadata can transform the ic-http URIs to HTTP URIs with their preferred boundary node provider and load the resources with those URIs.
This standard does not specify how to resolve the http-ic URI to a specific HTTP URI that can be accessed in a browser. This is left to future standards that are expected to evolve once the boundary node ecosystem is growing with further community participation. Such future standard would need to compose an HTTP URI from an http-ic URI based on the boundary node provider preference, and based on the preference choose a hostname.
Note: not all boundary node providers will provide access to each canister. Thus, this constrains the transformation function from http-ic URIs to HTTP URIs.
Background
RFC-3986 specifies the syntax and semantics of URIs. The following grammar defines all valid URIs:
URI = scheme ":" hier-part [ "?" query ] [ "#" fragment ]
hier-part = "//" authority path-abempty
/ path-absolute
/ path-rootless
/ path-empty
For a new URI scheme, a subset of the language defined by this grammar needs to be defined.
Specification
The "ic-http" scheme is used to locate blockchain network resources exposed by ICP canisters via the HTTP semantics for IC-HTTP URIs independently of the hostname of the boundary nodes via which the resources can be accessed.
URI schemes are lowercase according to RFC-3986, Section 3.1.
The ic-http scheme defined in this standard has the following grammar expressed following the formal language put forth in RFC-5234:
IC-HTTP-URI = "ic-http" ":" ic-http-hier-part [ "?" query ] [ "#" fragment ]
ic-http-hier-part = "//" ic-canister-principal ic-http-path
ic-canister-principal = 9*9( 5*5( base32char ) "-" ) 1*2( base32char )
/ 9*9( 5*5( base32char ) )
/ 1*8( 5*5( base32char ) "-" ) ( 1*5( base32char ) )
/ 1*5( base32char )
ic-http-path = path-abempty ; begins with "/" or is empty
/ path-absolute ; begins with "/" but not "//"
/ path-empty ; zero characters
path-abempty = *( "/" segment )
path-absolute = "/" [ segment-nz *( "/" segment ) ]
path-empty = 0<pchar>
segment = *pchar
segment-nz = 1*pchar
segment-nz-nc = 1*( unreserved / pct-encoded / sub-delims / "@" )
; non-zero-length segment without any colon ":"
query = *( pchar / "/" / "?" )
fragment = *( pchar / "/" / "?" )
unreserved = ALPHA / DIGIT / "-" / "." / "_" / "~"
pct-encoded = "%" HEXDIG HEXDIG
sub-delims = "!" / "$" / "&" / "'" / "(" / ")"
/ "*" / "+" / "," / ";" / "="
pchar = unreserved / pct-encoded / sub-delims / ":" / "@"
base32char = base32letterlowercase / base32letterlowercase / base32digit
base32letterlowercase = a-z
base32letteruppercase = A-Z
base32digit = 2-7
We refer the reader to RFC-3986 for details on the components of a URI and RFC-2234 for the definition of the productions in the above grammar that are not defined explicitly, such as ALPHA or DIGIT.
Examples
The following is an example for a regular HTTP URI
https://by6od-j4aaa-aaaaa-qaadq-cai.ic0.app/token-1234/image?format=jpeg&res=high&hash=1234...abcd&hash_algorithm=sha256
The same URI expressed using the approach using an ic-http URI is the following:
ic-http://by6od-j4aaa-aaaaa-qaadq-cai/token-1234/image?format=jpeg&res=high&hash=1234...abcd&hash_algorithm=sha256
FIX further examples to show all the features
IANA Registration
The current proposal has not yet been registered with IANA's URI scheme registry to make it an official URI scheme. This is a future step that should be taken once the approach has been sufficiently validated in reference implementations and found to be fit for the intended purpose.
Conclusions
With the boundary node edge infrastructure being opened up to community participants, additional boundary nodes with their own host names are expected to emerge. This ICRC standard is intended to unify the addressing of HTTP-exposed resources of canister smart contracts on the Internet Computer. It abstracts away the hostname of the boundary node used so far in HTTP URIs to address resources. This is important for any use case that references HTTP-exposed canister resources.
References
URI: https://datatracker.ietf.org/doc/html/rfc3986 Syntax: https://datatracker.ietf.org/doc/html/rfc5234 IANA URI scheme registry: https://www.iana.org/assignments/uri-schemes Internet Computer HTTP Gateway Protocol Specification: https://internetcomputer.org/docs/current/references/http-gateway-protocol-spec Augmented BNF for Syntax Specifications: ABNF: https://datatracker.ietf.org/doc/html/rfc2234
Some questions that we still need to discuss:
- Do we need to handle
httpsin this spec? What we define here is really theic-httpscheme, which is clearly independent of TLS, as is thehttpscheme. TLS is a layer on top. But still, this would lead to the URI scheme beingic-httpsif we do it in analogy tohttp. Or we just defineic-https, which includes security TLS out of the box. There will be no non-securedic-httpprotocol available anyway. - The mapping from
httpURIs used currently toic-httpURIs needs to be defined at some time. This will involved some form of directory of boundary nodes and information on which canisters are reachable through which boundary nodes. None of this infrastructure is in place currently, but needs to be to support a growing set of community-based boundary node providers.
Please find the PR for the working draft here: https://github.com/dfinity/ICRC/compare/icrc-91-working-draft