Zobrazeno 1 - 10
of 152
pro vyhledávání: '"H J, Kelly"'
Publikováno v:
Geoscientific Model Development, Vol 15, Pp 3815-3829 (2022)
This paper proposes a new method that combines checkpointing methods with error-controlled lossy compression for large-scale high-performance full-waveform inversion (FWI), an inverse problem commonly used in geophysical exploration. This combination
Externí odkaz:
https://doaj.org/article/317449046ad848a4b7c45d578e1fed9c
Autor:
Edward Stow, Paul H. J. Kelly
Publikováno v:
Frontiers in Computer Science, Vol 4 (2022)
Many systems for image manipulation, signal analysis, machine learning, and scientific computing make use of discrete convolutional filters that are known before computation begins. These contexts benefit from common sub-expression elimination to red
Externí odkaz:
https://doaj.org/article/1fd328d9fe254ea2a733370bb77800c8
Publikováno v:
Concurrency and Computation: Practice and Experience.
We propose COMDETECTIVE+, an inter-thread communication analyzer, and REUSETRACKER+, a reuse distance analyzer, that leverage the hardware features in AMD processors to support low-overhead profiling. Both tools employ the instruction-based sampling
Publikováno v:
IEEE Transactions on Visualization and Computer Graphics. 28:5178-5180
The topology of isosurfaces changes at isovalues of critical points, making such points an important feature when building contour trees or Morse-Smale complexes. Hexahedral elements with linear interpolants can contain additional off-vertex critical
Autor:
G.-T. Bercea, A. T. T. McRae, D. A. Ham, L. Mitchell, F. Rathgeber, L. Nardi, F. Luporini, P. H. J. Kelly
Publikováno v:
Geoscientific Model Development, Vol 9, Iss 10, Pp 3803-3815 (2016)
We present a generic algorithm for numbering and then efficiently iterating over the data values attached to an extruded mesh. An extruded mesh is formed by replicating an existing mesh, assumed to be unstructured, to form layers of prismatic cells.
Externí odkaz:
https://doaj.org/article/37c6927295694e98b35a709da8ac5176
Precise event sampling is a profiling feature in commodity processors that can sample hardware events and accurately locate the instructions that trigger the events. This feature has been used in a large number of tools to detect application performa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b362637f06bcf0fbfeaac4874eff8ac4
http://hdl.handle.net/10044/1/103627
http://hdl.handle.net/10044/1/103627
Autor:
Edward Stow, Abrar Ahsan, Yingying Li, Ali Babaei, Riku Murai, Sajad Saeedi, Paul H. J. Kelly
Focal-plane Sensor-processors (FPSPs) are a camera technology that enables low power, high frame rate computation in the image sensor itself, making them suitable for edge computation. To fit into the sensor array, FPSPs are highly resource-constrain
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::52d72e859c74448b1aae0ccb0b5a8cbe
http://hdl.handle.net/10044/1/99998
http://hdl.handle.net/10044/1/99998
Publikováno v:
Languages and Compilers for Parallel Computing ISBN: 9783030959524
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::59ec8615c7f68766727fe73aa9443511
https://doi.org/10.1007/978-3-030-95953-1_13
https://doi.org/10.1007/978-3-030-95953-1_13
Publikováno v:
FPL2021. The International Conference on Field-Programmable Logic and Applications (FPL)
FPL
FPL
This demo elaborates on the programmability aspect of Simodense, a recently released open-source softcore, optimised for evaluating custom SIMD instructions. CPUs featuring small reconfigurable areas for implementing custom instructions is an alterna
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6f5c7558bb461904b71cbe9b731aa3cb
http://hdl.handle.net/10044/1/90369
http://hdl.handle.net/10044/1/90369
Publikováno v:
FPL2021. The International Conference on Field-Programmable Logic and Applications (FPL)
FPL
FPL
Simodense is a high-performance open-source RISC-V (RV32IM) softcore, optimised for exploring custom SIMD instructions. In order to maximise SIMD instruction performance, the design’s memory system is optimised for streaming bandwidth, such as very
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a6b88e5a894f5bd9fbb096b0fb153a85
http://hdl.handle.net/10044/1/90081
http://hdl.handle.net/10044/1/90081