HiPC 2015 Accepted Papers
Throughput Regulation in Shared Memory Multicore Processors
Application Taxonomy via Algorithmic Commonality for Domain-specific Architecture Design
FlexCore: A Reconfigurable Processor Supporting Flexible, Dynamic Morphing
High Efficiency Generalized Parallel Counters for Xilinx FPGAs
2QW-Clock: An efficient SSD buffer management algorithm
Task-based multifrontal QR solver for GPU-accelerated multicore architectures
Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices
On the resilience of parallel sparse hybrid solvers
New Tridiagonal Systems Solvers on GPU architectures
A Stable Parallel Algorithm for Diagonally Dominant Tridiagonal Linear Systems
Optimizing Approximate Weighted Matching on Nvidia Kepler K40
Improving Communication Throughput by Multipath Load Balancing on Blue Gene/Q
Dynamic Adaptation for Elastic System Services using Virtual Servers
Understanding the Benefits of Asynchronous Data Transfers in Media Processors
Hardware-Transactional-Memory Based Speculative Parallel Discrete Event Simulation of Very Fine Grain Models
Towards Practical Page Placement for a Green Memory Manager
Efficient Barrier Implementation on the POWER8 Processor
On Accelerating Concurrent PCA Computations for Financial Risk Applications
A Performance Model for GPU-Accelerated FDTD Applications
Vectorized Big Integer Operations for Cryptosystems on Intel MIC Platform
Characterizing Large Dataset GPU Compute Workloads Targeting Systems with Die-Stacked Memory
A GPU-based MIS Aggregation Strategy
High Throughput Hierarchical Heavy Hitter Detection in Data Streams
Offloaded GPU Collectives using CORE-Direct and CUDA Capabilities on IB Clusters
High Performance OpenSHMEM Strided Communication Support with InfiniBand UMR
On the Use of Commodity Ethernet Technology in Exascale HPC Systems
Trigeneous Platforms for Energy Efficient Computing of HPC Applications
ColdBus: A Near-Optimal Power Efficient Optical Bus
A Simple BSP-based Model to Predict Execution Time in GPU Applications
Partition with side effects
Geographically Distributed Load Balancing with (Almost) Arbitrary Load Functions
Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU
Accelerating Complex Event Processing through GPUs
Efficient Batched Predecessor Search in Shared Memory on GPUs
Strategies of SIMD computing for image coding in GPU
IC-Data: Improving Compressed Data Processing in Hadoop
Dominoes: Speculative Repair in Erasure Coded Hadoop System
Collective Offload for Heterogeneous Clusters
Meta-scheduling of HPC Jobs in Day-Ahead Electricity Markets
Load Balancing and Accelerating Spatial Join Operations using Bitmap Indexing
Algorithm Level Fault Tolerance for Molecular Dynamic Applications
V-PFORDelta: Data Compression for Energy Efficient Computation of Time Series
Holistic Management of Sustainable Geo-Distributed Data Centers
Parallel Megabase DNA Sequence Comparison with OpenCL
Parallel Read Error Correction for Big Genomic Datasets
High Performance Front Camera ADAS Applications on TI's TDA3X Platform
Information Theory Based Genome-scale Gene Networks Construction using MapReduce