Is it chosen based on empirical observations?
We followed the original calculation paradigm of CLIP here, please refer to open_clip for more details.