| Multithreaded sparse matrix-matrix multiplication for ma... |
8 |
| Optimizations of the eigensolvers in the ELPA library |
7 |
| Batched QR and SVD algorithms on GPUs with applications ... |
7 |
| DVFS-aware application classification to improve GPGPUs ... |
5 |
| Accelerating the SVD two stage bidiagonal reduction and ... |
4 |
| Comparing load-balancing algorithms for MapReduce under ... |
4 |
| Proteus: Exploiting precision variability in deep neural... |
3 |
| SAGE: Percipient Storage for Exascale Data Centric Compu... |
3 |
| Manila: Using a densely populated PMC-space for power mo... |
3 |
| Performance optimization, modeling and analysis of spars... |
3 |