IFC4.3.x-development icon indicating copy to clipboard operation
IFC4.3.x-development copied to clipboard

BINARY data is not optimal

Open I-Sokolov opened this issue 2 years ago • 6 comments

I am working with user request to support texture and was surprised (if I understood correct) EXPRESS ISO 10303-11-2000 and SDAI ISO 10303-24-2001 define to store BINARY data as text string like "%0101001100" - character per bit

136 binary_literal = '% ' bit { bit } . 119 bit = '0' | '1'

Is it really good way to store textures? -:)

Can we change to base 64 like we have for GUIDs?

I-Sokolov avatar Jul 08 '22 16:07 I-Sokolov

I have looked into IFC4 example and confused. IFCBLOBTEXTURE(.T.,.T.,'TEXTURE',#58,$,'PNG',"089504E470D0.,,

According EXPRESS ISO 10303-11-2000 this is a text string in Universal Multiple-Octet Coded Character Set (UCS) and can not be interpreted as binary data

https://standards.buildingsmart.org/IFC/RELEASE/IFC4/ADD2_TC1/HTML/annex/annex-e/tessellation-with-blob-texture.ifc

I-Sokolov avatar Jul 08 '22 16:07 I-Sokolov

The p21 representation of binary as as follows:

HEX               = "0" | "1" | "2" | "3" | "4" | "5" | "6" | "7" |
                    "8" | "9" | "A" | "B" | "C" | "D" | "E" | "F" .
BINARY            = """" ( "0" | "1" | "2" | "3" ) { HEX } """" .

So it's a quoted (double quotes) hexadecimal string prefixed with [0-4] to store the remainder of integer division of the number of bits divided by four. That allows to differentiate between 0 -> 30 and 0000 -> 00.

So it's not as bad as the definition of literals in p11 (the schema) but in p21 (the model exchange) it's still a little bit worse than hexadecimal.

To me base64 does make a lot of sense for this.

My proposal would be to deprecate blob texture (because there is no way we can backwards-compatibility make these changes) but rather why not use IfcImageTexture for this purpose in the same way as glTF (and most modern web-based standards) with "data" URIs.

Something like the following:

...

In my view this is a completely backwards compatible usage of IfcImageTexture that's more efficient and inline with other modern graphic exchange formats.

aothms avatar Jul 09 '22 06:07 aothms

@aothms, Thanks for clarification. It means I've looked into old standard.

I-Sokolov avatar Jul 11 '22 15:07 I-Sokolov

Yes, proposal for base64 still actual

I-Sokolov avatar Jul 11 '22 15:07 I-Sokolov

My proposal would be to deprecate blob texture (because there is no way we can backwards-compatibility make these changes)

Generally, we can recognize is this hex-string by double-quotes like "07BF" or single-quoted string with other encoding ''

However I do not know how to specify it in EXPRESS:) Something like select but should not use type path when serialize.

I-Sokolov avatar Jul 12 '22 08:07 I-Sokolov

or single-quoted string with other encoding ''

However I do not know how to specify it in EXPRESS:)

I think that's the "beauty" of using the existing IfcImageTexture with IfcURIReference. Currently it states that IfcURIReference is based on RFC 3986, it seems to me that we should just make a documentation change to also include RFC 2397 https://datatracker.ietf.org/doc/html/rfc2397 So there isn't anything needed on the schema level.

aothms avatar Jul 12 '22 08:07 aothms