| The future of scientific workflows |
11 |
| Big data and extreme-scale computing: Pathways to Convergence-Toward a shaping strategy for a future software and data ecosystem for scientific inquiry |
7 |
| Optimization of lattice Boltzmann simulations on heterogeneous computers |
6 |
| Scalability study of an implicit solver for coupled fluid-structure interaction problems on unstructured meshes in 3D |
5 |
| Angara interconnect makes GPU-based Desmos supercomputer an efficient tool for molecular dynamics calculations |
5 |
| Anatomy of machine learning algorithm implementations in MPI, Spark, and Flink |
4 |
| Task-based programming in COMPSs to converge from HPC to big data |
4 |
| TweTriS: Twenty trillion-atom simulation |
4 |
| Communication analysis and optimization of 3D front tracking method for multiphase flow simulations |
4 |
| Acceleration of tensor-product operations for high-order finite element methods |
4 |
| METADOCK: A parallel metaheuristic schema for virtual screening methods |
4 |
| Optimizing the performance of reactive molecular dynamics simulations for many-core architectures |
3 |
| Finding parallel patterns through static analysis in C plus plus applications |
3 |
| Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors |
3 |
| An efficient MPI/OpenMP parallelization of the Hartree-Fock-Roothaan method for the first generation of Intel (R) Xeon Phi (TM) processor architecture |
3 |
| Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver |
3 |
| Topology-aware job mapping |
2 |
| Reducing the energy consumption of large-scale computing systems through combined shutdown policies with multiple constraints |
2 |
| Efficient model of tumor dynamics simulated in multi-GPU environment |
2 |
| SWIRL: High-performance many-core CPU code generation for deep neural networks |
2 |
| Use cases of lossy compression for floating-point data in scientific data sets |
2 |
| Computational reproducibility of scientific workflows at extreme scales |
2 |
| Hierarchical approach for deriving a reproducible unblocked LU factorization |
2 |
| A massively parallel semi-Lagrangian solver for the six-dimensional Vlasov-Poisson equation |
2 |
| A scalable and extensible checkpointing scheme for massively parallel simulations |
2 |
| High-performance epistasis detection in quantitative trait GWAS |
2 |
| Pricing schemes for energy-efficient HPC systems: Design and exploration |
2 |
| The role of machine learning in scientific workflows |
2 |
| Performance analysis of fully explicit and fully implicit solvers within a spectral element shallow-water atmosphere model |
2 |
| Acceleration of the IMplicit-EXplicit nonhydrostatic unified model of the atmosphere on manycore processors |
2 |
| Toward performance portability of the Albany finite element analysis code using the Kokkos library |
2 |
| Strong scaling for numerical weather prediction at petascale with the atmospheric model NUMA |
2 |
| Beyond spatial scalability limitations with a massively parallel method for linear oscillatory problems |
2 |
| Porting and optimization of solidification application for CPU-MIC hybrid platforms |
2 |
| A fast massively parallel two-phase flow solver for microfluidic chip simulation |
1 |
| Fine-grained floating-point precision analysis |
1 |
| MiniApps derived from production HPC applications using multiple programing models |
1 |
| An improved parallelism scheme for deterministic discrete ordinates transport |
1 |
| Modeling and simulations of broad-area edge-emitting semiconductor devices |
1 |
| GPU-based computational modeling of magnetic resonance imaging of vascular structures |
1 |
| Deploying massive runs of evolutionary algorithms with ECJ and Hadoop: Reducing interest points required for face recognition |
1 |
| Soft fault detection and correction for multigrid |
1 |
| Partial differential equations preconditioner resilient to soft and hard faults |
1 |
| Towards cloud-based parallel metaheuristics: A case study in computational biology with Differential Evolution and Spark |
1 |
| Leveraging the accelerated processing units for seismic imaging: A performance and power efficiency comparison against CPUs and GPUs |
1 |
| Exploring the feasibility of lossy compression for PDE simulations |
1 |
| The dividends of investing in computational software design: A case study |
1 |
| Dynamic reconfiguration of noniterative scientific applications: A case study with HPG aligner |
1 |
| A massively scalable distributed multigrid framework for nonlinear marine hydrodynamics |
1 |
| Wind farm simulations using an overset hp-adaptive approach with blade-resolved turbine models |
1 |