Field G. Van Zee

Results 12 issues of Field G. Van Zee

This branch contains preliminary support for a new `.c_next` field within the `auxinfo_t` struct. It is fully implemented for `gemm`. Caveats: - The "wrap-around" address computation for the edge cases...

enhancement

Going forward, this PR branch will implement the changes set forth in #479. To start out with, only the new `configure` options are implemented. I'll incrementally update parts of the...

enhancement

As mentioned in [1], the goal is to allow BLIS to build and link on systems with minimal or nonexistent operating systems. This includes (but is not limited to) stripping...

enhancement

Provide functionality to pre-pack entire matrices (A or B), which can then be passed into the framework and computed upon while bypassing the traditional matrix packing stages. This will require...

enhancement

When setting parallelism the "automatic way" as described in the [Multithreading](https://github.com/flame/blis/blob/master/docs/Multithreading.md) documentation, the caller defers to BLIS to factorize the total number of threads into the number of ways of...

enhancement

The control tree can (in principle) be retrofitted with fields that can be used by threads to track timing information (within each loop and within packing functions) that can be...

enhancement

Read an environment variable, say, `BLIS_CPU_AFFINITY`, and use its contents to call `pthread_setaffinity_np()` to set the threads' affinity masks. Ideally, the same environment variable would control OpenMP thread affinity, in...

enhancement

A spin-off from issue #358: "BLIS should warn loudly when CPU type can't be detected." > > Are you suggesting outputting this message to stderr? It would take some rejiggering...

enhancement

For GNU gcc, this takes the form of the `-pg` option. Should be disabled by default.

enhancement

Let's discuss new operations that we might like to add to BLIS, specifically those that would fall into level-1v or level-1m families (and perhaps level-2): - [ ] element-wise vector/matrix...