swapforth icon indicating copy to clipboard operation
swapforth copied to clipboard

AVX instructions for x64

Open postspectacular opened this issue 9 years ago • 1 comments
trafficstars

Hi James, I've been studying and successfully building the x64 version on OSX, but then realised that the code contains a few AVX instruction not supported on slightly older machines (e.g. my 2010 MBP i7 lacks AVX). Are these really needed or could they be conditionally be rewritten using only SSE2/3/4? I'll do some digging what these instruction exactly achieve and might create a PR (if I'm successful), but any guidance would be appreciated. Thanks!

postspectacular avatar May 28 '16 12:05 postspectacular

Aha, yes. If you do create a pull request would be appreciated. These instructions are all doing word matching in the dictionary inner loop. There is also some AVX instructions to do case-folding, so that the comparison is case-insentive.

Multiple SSE4 instructions could do that same work, I think.

Thanks! J.

jamesbowman avatar May 29 '16 23:05 jamesbowman