Výsledky vyhledávání - "Hari Subramoni"

System-Level Scalable Checkpoint-Restart for Petascale Computing

Autor: Jerome Vienne, Kapil Arya, Gene Cooperman, Shawn Matott, Jiajun Cao, Dhabaleswar K. Panda, Rohan Garg, Hari Subramoni

Publikováno v: ICPADS

Fault tolerance for the upcoming exascale generation has long been an area of active research. One of the components of a fault tolerance strategy is checkpointing. Petascale-level checkpointing is demonstrated through a new mechanism for virtualizat

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4212e2bfb2230f760281b2394efcdb06
https://doi.org/10.1109/icpads.2016.0125

Zobrazit plný text záznamu

Impact of HPC Cloud Networking Technologies on Accelerating Hadoop RPC and HBase

Autor: Hari Subramoni, Xiaoyi Lu, Shashank Gugnani, Dipti Shankar, Dhabaleswar K. Panda

Publikováno v: CloudCom

The performance of Hadoop components can be significantly improved by leveraging advanced features such as Remote Direct Memory Access (RDMA) on modern HPC clusters, where high-performance networks like InfiniBand (IB) and RoCE have been deployed wid

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::632976cee583d6902b74bffc3f4a3998
https://doi.org/10.1109/cloudcom.2016.0057

Zobrazit plný text záznamu

Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits

Autor: Mingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Hari Subramoni, Jie Zhang, Dhabaleswar K. Panda

Publikováno v: SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d6b2a6e2feab3f4227d8ea63bb980878
https://doi.org/10.1109/sc.2016.36

Zobrazit plný text záznamu

Designing High Performance Heterogeneous Broadcast for Streaming Applications on GPU Clusters

Autor: Ching-Hsiang Chu, Dhabaleswar K. Panda, Akshay Venkatesh, Hari Subramoni, Bracy Elton, Khaled Hamidouche

Publikováno v: SBAC-PAD

High-performance streaming applications are beginning to leverage the compute power offered by graphics processing units (GPUs) and high network throughput offered by high performance interconnects such as InfiniBand (IB) to boost their performance a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::09e1850ae43387f5edf0f562fb0cb0a0
https://doi.org/10.1109/sbac-pad.2016.16

Zobrazit plný text záznamu

Adaptive and Dynamic Design for MPI Tag Matching

Autor: Mohammadreza Bayatpour, Dhabaleswar K. Panda, Hari Subramoni, Sourav Chakraborty

Publikováno v: CLUSTER

The Message Passing Interface (MPI) standard specifies the use of (source, tag, communicator) tuple to identify whether an incoming message is what the receiver process is expecting. The cost associated with this process, commonly known as "tag match

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::fb9ff5a93ba962dadb24d51897f616dd
https://doi.org/10.1109/cluster.2016.69

Zobrazit plný text záznamu

SHMEMPMI -- Shared Memory Based PMI for Improved Performance and Scalability

Autor: Dhabaleswar K. Panda, Hari Subramoni, Sourav Chakraborty, Jonathan Perkins

Publikováno v: CCGrid

Dense systems with large number of cores per node are becoming increasingly popular. Existing designs of the Process Management Interface (PMI) show poor scalability in terms of performance and memory consumption on such systems with large number of

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::3b6068a2405329322653c701b4c3ec0b
https://doi.org/10.1109/ccgrid.2016.99

Zobrazit plný text záznamu

Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-Enabled Systems

Autor: Hari Subramoni, Dip Sankar Banerjee, Khaled Hamidouche, Dhabaleswar K. Panda, Akshay Venkatesh, C-H. Chu

Publikováno v: IPDPS

GPU accelerators are widely used in HPC clusters due to their massive parallelism and high throughput-per-watt. Data movement continues to be the major bottleneck on GPU clusters, more so when data is non-contiguous, which is common in scientific app

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::da07a0cc8cc3f42303e9b83904afb02e
https://doi.org/10.1109/ipdps.2016.99

Zobrazit plný text záznamu

INAM2: InfiniBand Network Analysis and Monitoring with MPI

Autor: Dhabaleswar K. Panda, Albert Mathews Augustine, Mark Arnold, Hari Subramoni, Khaled Hamidouche, Xiaoyi Lu, Jonathan Perkins

Publikováno v: Lecture Notes in Computer Science ISBN: 9783319413204

Modern high-end computing is being driven by the tight integration of several hardware and software components. On the hardware front, there are the multi-/many-core architectures (including accelerators and co-processors) and high-end interconnects

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d5aae3204cbf4490c876b8838bc0aac0
https://doi.org/10.1007/978-3-319-41321-1_16

Zobrazit plný text záznamu

GPU-Aware Design, Implementation, and Evaluation of Non-blocking Collective Benchmarks

Autor: Hari Subramoni, Jonathan Perkins, Dhabaleswar K. Panda, Khaled Hamidouche, Ammar Ahmad Awan, Akshay Venkatesh

Publikováno v: EuroMPI

As we move towards efficient exascale systems, heterogeneous accelerators like NVIDIA GPUs are becoming a significant compute component of modern HPC clusters. It has become important to utilize every single cycle of every compute device available in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::64c7efaf739f3779f63b940e0a85a5c7
https://doi.org/10.1145/2802658.2802672

Zobrazit plný text záznamu

High Performance MPI Datatype Support with User-Mode Memory Registration: Challenges, Designs, and Benefits

Autor: Dhabaleswar K. Panda, Hari Subramoni, Khaled Hamidouche, Mingzhe Li, Xiaoyi Lu

Publikováno v: CLUSTER

Noncontiguous data communication has been heavily adopted in scientific applications, especially for those written with MPI. Common strategies to handle noncontiguous data, like packing/unpacking, incur significant performance overhead during communi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::98527282d96a680ab77b5c8c06650639
https://doi.org/10.1109/cluster.2015.41

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání