Regarding the model's architecture, I have a question. CONCH seems to be the same as coca. Where exactly are their differences?