UB FFM: Suchergebnisse - GPU Array Access Auto-Tuning

Die Ergebnisse können Gästen nur in Auswahl angezeigt werden. Bitte loggen Sie sich für Vollzugriff ein: Login

1
2
3
4
5
6
7
8
9
10
11
weiter

GPU Array Access Auto-Tuning
Weber, Nicolas ; Weber, Nicolas

E-Ressource

Zum Volltext

A fast integral image generation algorithm on GPUs
Dang, Qingqing ; Yan, Shengen ; Wu, Ren
2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS) Parallel and Distributed Systems (ICPADS), 2014 20th IEEE International Conference on. :624-631 Dec, 2014

Konferenz

Prüfe Verfügbarkeit

A highly efficient I/O-based out-of-core stencil algorithm with globally optimized temporal blocking
Midorikawa, Hiroko ; Tan, Hideyuki
2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM) Communications, Computers and Signal Processing (PACRIM), 2017 IEEE Pacific Rim Conference on. :1-6 Aug, 2017

Konferenz

Prüfe Verfügbarkeit

Directive-Based Data Partitioning and Pipelining and Auto-Tuning for High-Performance GPU Computing
Cui, Xuewen ; Cui, Xuewen

E-Ressource

Zum Volltext

(De/Re)-Composition of Data-Parallel Computations via Multi-Dimensional Homomorphisms.
Rasch, Ari
ACM Transactions on Programming Languages & Systems. Sep2024, Vol. 46 Issue 3, p1-74. 74p.

Fachzeitschrift

PDF-Volltext

Optimization Techniques for GPU Programming.
HIJMA, PIETER ; HELDENS, STIJN ; SCLOCCO, ALESSIO ; et al.
ACM Computing Surveys. Nov2023, Vol. 55 Issue 11, p1-81. 81p.

Fachzeitschrift

PDF-Volltext

Umpalumpa: a framework for efficient execution of complex image processing workloads on heterogeneous nodes.
Střelák, David ; Myška, David ; Petrovič, Filip ; et al.
Computing. Nov2023, Vol. 105 Issue 11, p2389-2417. 29p.

Fachzeitschrift

Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations.
Wen, Mei ; Huang, Da-fei ; Xun, Chang-qing ; et al.
Frontiers of Information Technology & Electronic Engineering; Nov2015, Vol. 16 Issue 11, p899-916, 18p

Fachzeitschrift

Zum Volltext (via DOI)

Hardware Design of DRAM Memory Prefetching Engine for General-Purpose GPUs.
Gabbay, Freddy ; Salomon, Benjamin ; Golan, Idan ; et al.
Technologies (2227-7080); Oct2025, Vol. 13 Issue 10, p455, 31p

RANDOM access memory GRAPHICS processing unit... HARDWARE design & constr... MATHEMATICAL optimizatio... PATTERN perception COMPUTER memory manageme... …

Fachzeitschrift

Zum Volltext (via DOI)

A Low-latency On-chip Cache Hierarchy for Load-to-use Stall Reduction in GPUs.
MAHANI, NEGIN (SADAT) (NEMATOLLAHI) ; FALAHATI, HAJAR ; DARABI, SINA ; et al.
ACM Transactions on Architecture & Code Optimization; Sep2025, Vol. 22 Issue 3, p1-27, 27p

Fachzeitschrift

Prüfe Verfügbarkeit

Preserving provability over GPU program optimizations with annotation-aware transformations.
Şakar, Ömer ; Safari, Mohsen ; Huisman, Marieke ; et al.
Formal Methods in System Design; Dec2025, Vol. 67 Issue 3, p316-372, 57p

PROGRAM transformation SOFTWARE verification OPTIMIZATION algorithms PARALLEL programs (Compu... SCIENTIFIC observation

Fachzeitschrift

Zum Volltext (via DOI)

Optimizing OpenCL Barrier Synchronization and Memory Efficiency on Multi-Core DSPs.
Gao, Wanrong ; Fang, Jianbin ; Zhang, Peng ; et al.
ACM Transactions on Architecture & Code Optimization; Dec2025, Vol. 22 Issue 4, p1-26, 26p

Fachzeitschrift

Prüfe Verfügbarkeit

On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures.
Morgan, Nathaniel ; Yenusah, Caleb ; Diaz, Adrian ; et al.
Information; Nov2024, Vol. 15 Issue 11, p673, 24p

COMPUTER architecture PARALLEL computers MODERN architecture C++ FORTRAN

Fachzeitschrift

Zum Volltext (via DOI)

Optimizing General Sparse Matrix-Matrix Multiplication on the GPU.
WANG, YIZHUO ; LIN, HONGPENG ; WEI, BINGXIN ; et al.
ACM Transactions on Architecture & Code Optimization; Dec2025, Vol. 22 Issue 4, p1-25, 25p

Fachzeitschrift

Prüfe Verfügbarkeit

Deep learning data handling: exploring file formats and access strategies.
Parraga, Edixon ; Leon, Betzabeth ; Mendez, Sandra ; et al.
Cluster Computing; Oct2025, Vol. 28 Issue 9, p1-23, 23p

DEEP learning HIGH performance computi... INFORMATION retrieval ELECTRONIC data processi... DATA management DATA structures …

Fachzeitschrift

Zum Volltext (via DOI)

Cross-core Data Sharing for Energy-efficient GPUs.
FALAHATI, HAJAR ; SADROSADATI, MOHAMMAD ; QIUMIN XU ; et al.
ACM Transactions on Architecture & Code Optimization; Sep2024, Vol. 21 Issue 3, p1-32, 32p

Fachzeitschrift

Prüfe Verfügbarkeit

DCSolver: Accelerating Sparse Iterative Solvers via Divide-and-Conquer on GPUs.
HAOZHONG QIU ; CHUANFU XU ; JIANBIN FANG ; et al.
ACM Transactions on Architecture & Code Optimization; Sep2025, Vol. 22 Issue 3, p1-25, 25p

Fachzeitschrift

Prüfe Verfügbarkeit

Research on Malodor Component Identification Based on Sensor Array.
Xie, Jiaxing ; Chen, Wen ; Chen, Shiyun ; et al.
Sensors (14248220); Jul2025, Vol. 25 Issue 13, p3857, 20p

SENSOR arrays ENVIRONMENTAL monitoring GAS detectors DATA analysis ELECTRONIC noses SMELL

Fachzeitschrift

Zum Volltext (via DOI)

I/O Access Patterns in HPC Applications: A 360-Degree Survey.
BEZ, JEAN LUCA ; BYNA, SUREN ; IBRAHIM, SHADI
ACM Computing Surveys. Feb2024, Vol. 56 Issue 2, p1-41. 41p.

Fachzeitschrift

PDF-Volltext

SNCL: a supernode OpenCL implementation for hybrid computing arrays.
Tang, Tao ; Lu, Kai ; Peng, Lin ; et al.
Journal of Supercomputing; May2024, Vol. 80 Issue 7, p9471-9493, 23p

HETEROGENEOUS computing PARALLEL programming COMPUTER systems ENERGY consumption ARTIFICIAL intelligence SCALABILITY

Fachzeitschrift

Zum Volltext (via DOI)

1
2
3
4
5
6
7
8
9
10
11
weiter

Treffer 1 - 20 von 689

Seite in der Trefferliste auswählen

Seite in der Trefferliste auswählen

Treffer weiter einschränken