tracy RFC: Add blob feature

This adds the ability to trace the status of arbitrary blobs in memory.

This is an RFC only, to see if I am thinking into the right direction. The encoding parameter is meant to add some decoding ability in the profiler later on. Documentation and timeline integration is still missing completely.

Dec 03 '24 18:12 eibach

What is the intended use case?

Have you tried sending "large" blobs, i.e. larger than enum { TargetFrameSize = 256 * 1024 } bytes?

Dec 03 '24 18:12 wolfpld

My use case is recording structs or packets on the fly so I can inspect them later. Intended size is << 64k, with 64k as upper limit.

Dec 03 '24 18:12 eibach

I'm not sure about imposing a limit. You may want instead to send the 3D scene geometry for debug purposes, and that could be easily in the megabytes range. (I actually did such a thing: https://bitbucket.org/wolfpld/tracy/commits/branch/mesh-debug)

Dec 03 '24 18:12 wolfpld

Ok, I see. I have used the string infrastructure because I thought that identical blobs should only get stored and transferred once. Do you think that would also make sense for bigger blobs like the geometry from your usecase?

Dec 03 '24 18:12 eibach

Sorry, I don't remember the exact details of how things are transferred, so I can't comment on the specifics.

Hashing things for storage shouldn't be a problem, you can get tens of GB/s of hashing perf e.g. with xxh3.

Hashing things for transfer looks like a no-go for me, as you'd need to do it on the client, and doing extra calculations when you are profiling and want to keep impact minimal is something you'd generally want to avoid.

Dec 03 '24 19:12 wolfpld

Ah yes, you are right, hashing only is done on the server side. I got that mixed up. I will check if I hit any limits when I crank up blob size.

Dec 03 '24 19:12 eibach

Works only up to 64k - 1 bytes using the string infrastructure. So maybe I have to come up with a different transfer strategy...

Dec 03 '24 19:12 eibach

Frame image transfers are another example, but still limited to ~256 KB. Some sort of data part transfer would be needed, something like Worker::QueryDataTransfer() is doing, but in reverse.

(For some reason I'm not able to add a comment on GitHub.)

Dec 03 '24 19:12 wolfpld

Alright, I will see if I can get some inspiration from Worker::QueryDataTransfer()

Dec 03 '24 20:12 eibach

I am struggling with the implementation of Worker::QueryDataTransfer(). Why is everything divided in pieces up to 12 bytes and not larger chunks? Doesn't that generate a lot of overhead? Probably I am missing the point here.

Dec 09 '24 19:12 eibach

Originally the query was 1 byte type + 8 bytes pointer, and that was enough. Then some queries needed more space, so an additional 4 bytes were added, which are unused in all other queries. Splitting larger data into 12-byte chunks is a result of retrofitting this system for, well, larger data.

Yes, it's very inefficient, but the only thing being transferred this way are source filename paths that the client may have available, so it's not a big deal.

While the implementation can be improved, there are special precautions that need to be taken for things to work properly. The client-to-server data pipe is basically not limited in any way, as it requires high bandwidth. The kernel will send a certain number of packets over the wire, and then wait for a confirmation of retrieval before continuing. Now, if the receiving end also wants to send a large amount of data, it will never read what was received, and these confirmations will never be delivered, resulting in a deadlock. The fixed size of server query packets is an implementation detail that makes it easier to calculate how much data can be sent before a read must occur.

Note that this is only a consideration for server-to-client communication. The memory blobs you want to send are from client to server, and the much larger available size limit (256 KB) is another unrelated implementation detail.

Dec 09 '24 20:12 wolfpld

This is my idea for the client side. If you agree, I would start implementing the server side.

Dec 09 '24 22:12 eibach

I have added support for fragmented transfer of blob data. Now uint32_t should be the limit ;)

Dec 16 '24 21:12 eibach

Why is m_serverQuerySpaceLeft increased? The client should send all the blob fragments in succession, and the server doesn't make any queries?

Dec 17 '24 12:12 wolfpld