[WIP]MAHOUT-1974 (dense cuda multiplication)
Purpose of PR:
Please give a short description of what this PR is for.
Important ToDos
Please mark each with an "x"
- [x] A JIRA ticket exists (if not, please create this first)[https://issues.apache.org/jira/browse/MAHOUT/]
- [x] Title of PR is "MAHOUT-XXXX Brief Description of Changes" where XXXX is the JIRA number.
- [x] Created unit tests where appropriate
- [ ] Added licenses correct on newly added files
- [ ] Assigned JIRA to self
- [ ] Added documentation in scala docs/java docs, and to website
- [ ] Successfully built and ran all unit tests, verified that all tests pass locally.
If all of these things aren't complete, but you still feel it is appropriate to open a PR, please add [WIP] after MAHOUT-XXXX before the descriptions- e.g. "MAHOUT-XXXX [WIP] Description of Change"
Does this change break earlier versions?
Is this the beginning of a larger project for which a feature branch should be made?
need to rebase
@nsakharnykh @pat @rawkintrevo FYI Sparse Sparse vlaues are correct, dense dense is implemented but untested.
@nsakharnykh sorry for the state of this branch, I tend to commit a lot on this project, and leave a lot of [WIP]s in when jumping around to other branches. Resolved all conflicts the against CUDA branch.. going to start looking at Dense %*% sparse again. First need to read up on JCublas2._.
@nsakharnykh I think that I'll fix the routing on this and test the sorted CSC requirement (MAHOUT-2004 and MAHOUT-2003) and then after some cleaning on this, we should push to mahout/CUDA.