Skip to content

AOMP Release 16.0-3

Compare
Choose a tag to compare
@estewart08 estewart08 released this 08 Dec 23:25

These are the release notes for AOMP 16.0-3. This release uses modifications to the LLVM development trunk called the "amd-stg-open" branch. This is found at https://github.com/RadeonOpenCompute/llvm-project. The amd-stg-open branch is constantly changing as AMD merges upstream development trunk with its internal open development efforts. Some AMD modifications are experimental and/or under review for the LLVM upstream mono-repo. The AOMP release is a snapshot of amd-stg-open and supporting repositories to build various components.

For AOMP 16.0-3, the last trunk commit is 11e86868c1a1ee67a1d88ef84b68193d06dc996 on Nov 14, 2022. This is the 4th AOMP release for LLVM 16 development. The last amd-only commit is b642bb5cf84bbbdcc3e8748c5ceeb72c7bb07144 on Dec 2, 2022. This forms a frozen branch now called "aomp-16.0-3". See https://github.com/RadeonOpenCompute/llvm-project/tree/aomp-16.0-3.

AOMP is a "standalone" build of all necessary ROCm components with the exception of the kernel module and libdrm. The non llvm-project components for this release were built with ROCM 5.4.0 sources.

These are the changes from 16.0-2 to 16.0-3 include:

  • Build includes gfx90c, gfx1035, and gfx1036.
  • Fix to rocm_agent_enumerator to correctly identify gfx90c.
  • Fix issue #435 "abs undefined within device block #435".
  • More enhancements to xteam reductions .
  • Ignore map clause option with USM.
  • Additional support for OMPT functions "get_device_time" and "get_record_type".
  • NUM_QUEUES_PER_DEVICE default to 1.
  • Fixed clang-build-select-link to honor -fdisable-host-devmem.
  • Fixed openmp lib-debug build overwriting release libraries/plugins.
  • Updated cmake version to 3.22.1.
  • Added Ubuntu 22.04 package.

Errata:
(potential regressions from 16.0-2):

  • Smoke test failures:
    clang-337336 - Performance decrease, may cause test to timeout after 1 min. 16.0-2 showed 30-40 secs.

(potential regressions from 16.0-1):

  • Smoke test failures (issue at -O0):
    clang-ifaces: core dump (gfx908)
    clang-337336: core dump gfx908)
    clang-325070: core dump (gfx908)

(potential regressions from 16.0-0):

  • Performance decrease with lulesh
  • Performance decrease with Nekbone (performance improved in 16.0-3, but still not at 16.0-0 levels.)
  • Smoke test failures:
    flang-315870: (resolved by building this test case with cov5)
    managed_memory: segfault, when 2+ devices are present
  • Hip example failure:
    device-lib