Zobrazeno 1 - 10
of 11
pro vyhledávání: '"Samantika Subramaniam"'
Autor:
Joel Emer, Samantika Subramaniam, Tryggve Fossum, Aamer Jaleel, William C. Hasenplaugh, Simon C. Steely, Carl J. Beckmann
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 10:1-24
As microprocessor designs integrate more cores, scalability of cache coherence protocols becomes a challenging problem. Most directory-based protocols avoid races by using blocking tag directories that can impact the performance of parallel applicati
Publikováno v:
SBAC-PAD
The term "SQL-on-Hadoop" has recently gained significant traction [19]. Impala represents a new emerging class of SQL-on-Hadoop systems that exploit a shared-nothing parallel database architecture over Hadoop. Impala was designed to close the gap of
Publikováno v:
ASPLOS
When several applications are co-scheduled to run on a system with multiple shared LLCs, there is opportunity to improve system performance. This opportunity can be exploited by the hardware, software, or a combination of both hardware and software.
Autor:
Samantika Subramaniam, Gabriel H. Loh
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 6:1-33
Allowing loads that do not violate memory ordering to issue out of order with respect to earlier unresolved store addresses is very important for extracting parallelism in large-window superscalar processors. Previous research has proposed memory dep
Publikováno v:
ISPASS
For academic computer architecture research, a large number of publicly available simulators make use of relatively simple abstractions for the microarchitecture of the processor pipeline. For some types of studies, such as those for multi-core cache
Publikováno v:
HPCA
Some instructions have more impact on processor performance than others. Identification of these critical instructions can be used to modify and improve instruction processing. Previous work has shown that the criticality of instructions can be dynam
Publikováno v:
HPCA
Simultaneous multithreading (SMT) attempts to keep a dynamically scheduled processorpsilas resources busy with work from multiple independent threads. Threads with long-latency stalls, however, can lead to a reduction in overall throughput because th
Publikováno v:
2016 28th International Symposium on Computer Architecture & High Performance Computing (SBAC-PAD); 2016, p182-189, 8p
Autor:
Gabriel H. Loh, Samantika Subramaniam
Publikováno v:
MICRO
Modern processors use CAM-based load and store queues (LQ/SQ) to support out-of-order memory scheduling and store-to-load forwarding. However, the LQ and SQ scale poorly for the sizes required for large-window, high- ILP processors. Past research has
Autor:
Samantika Subramaniam, Gabriel H. Loh
Publikováno v:
2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06); 2006, p273-284, 12p