The embed output is 1.4T and it's too large to load this array to memory. Any tips for this?
You could try PCA - see Figure 2 in appendix A of this paper for accuracy vs dimension analysis