Abstract
High-resolution simulations of estuarine saltwater intrusion are computationally demanding and require efficient execution on heterogeneous computing platforms. In this study, the use of standard Fortran parallelization—DO CONCURRENT—to accelerate the unstructured quadrilateral grid finite-differencing estuarine and coastal ocean model (UFDECOM-i) within a unified codebase for both multicore CPUs and GPUs was investigated. Using the NVFORTRAN compiler, three versions were implemented: MC-UFDECOM-i on multicore CPUs, GPU-UFDECOM-i using automatic data migration, and GPUA-UFDECOM-i using lightweight OpenACC directives for explicit data management. The results show that DO CONCURRENT enables scalable shared-memory parallelism on CPUs, with speedups of up to 16.32 ×, and provides functional portability to GPUs without code modification. However, optimal GPU performance requires explicit data management, with GPUA-UFDECOM-i reaching a maximum speedup of 21.48 × . These results demonstrate that DO CONCURRENT ensures portability and maintainability, whereas explicit data control remains essential for high GPU efficiency.
| Original language | English |
|---|---|
| Article number | 106911 |
| Journal | Environmental Modelling and Software |
| Volume | 198 |
| DOIs | |
| State | Published - Mar 2026 |
Keywords
- Computational efficiency
- Fortran DO CONCURRENT
- GPU parallel computing
- High portability
- Saltwater intrusion model
Fingerprint
Dive into the research topics of 'Parallelization of the estuarine saltwater intrusion numerical forecast model UFDECOM-i using Fortran DO CONCURRENT'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver